TCG ACT ATG 
ser thr met 

61 

TCA AGC TCC 
ser ser ser 




121 
AGA 
arg 

181 
CTG 
leu 

241 
AGC 
ser 

301 
ATC 
lie 

361 
GAC 
asp 

421 
GGA 
gly 

481 
AAA 

lys 

541 
GAC 
asp 

601 
CTC 
leu 

661 
CTT 
leu 

721 
ACC 
thr 

781 
GGG 
gly 

841 
GGC 
gly 



GTA CCC 
val pro 



GAT GCC 
asp ala 



CTG CTA 
leu leu 



GCC GCC 
ala ala 



TTT ACA 
phe thr 



GCC GGT 
ala gly 



ACA AGT 
thr ser 



ACT ACT 
thr thr 



ATT AGG 
ile arg 



TTC ATC 
phe ile 



CAA GTC 
gin val 



GAA GGG 
glu gly 



CTA AAT 
leu asn 



AAT 

asn ala 



TGC TCC 
cys ser 



GCC AGC 
ala ser 



GAA GGG 
glu gly 

AGT TCT 
ser ser 



CTT TAG 
leu tyr 



GTT GCA 
vai ala 



GCC ATT 

ala ile 



GGA ATA 
gly ile 



AGC TTC 
ser phe 



AGC ACT 
ser thr 



GAA GGA 
glu gly 

CAC GAC 
his asp 



CTA CCT 
leu pro 



GAT ACT 
asp thr 



ACC CAG 
thr gin 



GTG CTG 
val leu 



CGC AAC 
arg asn 



AAG TGG 
lys trp 



TGT GTG 
cys val 



CTG CAG 
leu gin 



GCC AGA 
ala arg 



GAA AAT 
glu asn 



CTC ATG 
leu met 



ATT GCT 
ile ala 



TTA TGT 
leu cys 



AAG TCA 
lys ser 



GGG GAG 
gly glu 



TTG AGA 
leu arg 



GCT CCA 
ala pro 



CAG GAC 
gin asp 



TCC TCA 
ser ser 



ATC AGC 
ile ser 



CTG GAC 
leu asp 



TTA GTT 
leu val 



CGC AGA 
arg arg 



AAT GTG 
asn val 



TCT TCC 
ser ser 



ATC TGC 
ile cys 



GAC CTG 
asp leu 



TTT GAG 
phe glu 



AGG GAT 
arg asp 



ATG ATG 
met met 



TGG AAG 
trp lys 



31 

ACA TCT 
thr ser 

91 

CAG AAG 
gin lys 



151 

CCG GGG 
pro gly 

211 
GAA GTA 
glu val 

271 
CCA CGC 
pro arg 

331 
GGC ATC 
gly ile 

391 
TAC CGC 
tyr arg 

451 
GCT CTG 

ala leu 

511 
TTG CCC 
leu pro 

571 
TTC CTC 
phe leu 

631 
CCA TCA 
pro ser 

691 
TAT AAG 
tyr lys 

751 
GTC AAG 
val lys 

811 
CGC CGC 
arg arg 

871 
AAA GAG 
lys glu 



CCT TGT CCT TCC ATA TCT TCC CAG AAC 
pro cys pro ser ile ser ser gin asn 



ATC GCC AGC ATG TTC GAT CGG ACT TCC 
ile ala ser met phe asp arg thr ser 

Cadherin 
! XX EC motif XX I 
CTC CTC TTC ACA GAA CTG GCT GCT GCC 
leu leu phe thr glu leu ala ala ala 



CAA AGG AAA GCT GTC AGT GCA ATT CAC 
gin arg lys ala val ser ala ile his 



TGT GTC AAA CCA GAG GTG AAG GTC AAA 
cys val lys pro glu val lys val lys 



ATT TTG GAT GCT TTG CCA CAG CTC TGT 
ile leu asp ala leu pro gin leu cys 



ACC AGT GGC TCG GAT GAA GAA CAA GAA 
thr ser gly ser asp glu glu gin glu 



GCC ATA GCA GGG AAT AAT TTC AAT TTG 
ala ile ala gly asn asn phe asn leu 



TAT AAG CAG TAC AAC ATG CTG AAC GCG 
tyr lys gin tyr asn met leu asn ala 



TGG ATC ATG AAA AAT GCT GAT CAG AGC 
trp ile met lys asn ala asp gin ser 



ACG CAG CTC AAC AGG ATT TTA GAT CTA 
thr gin leu asn arg ile leu asp leu 



GGA AAA CAG AGT TCT GAC AAA GTC AGT 
gly lys gin ser ser asp lys val ser 



GCC CGG CTG GAA GAG GCT TTG CTG CGT 
ala arg leu glu glu ala leu leu arg 



CGG GCT CCA GGG AAC GAC CGA TTT CCA 
arg ala pro gly asn asp arg phe pro 



CAG ACA CAT TGG CGG CAA GCT AAT GAG 
gin thr his trp arg gin ala asn glu 
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901 931 

AAG CTA GAT AAA ACA AAG GCC GAG TTA GAT CAA GAA GCC TTG ATC AGT GGC AAT CTG GCT 
lys leu asp lys thr lys ala glu leu asp gin glu ala leu ile ser gly asn leu ala 

961 991 

ACA GAA GCA CAT TTA ATC ATC CTG GAT ATG CAG GAA AAC ATT ATC CAG GCG AGC TCG GCT 
thr glu ala his leu ile ile leu asp met gin glu asn ile ile gin ala ser ser ala 

1021 1051 

CTG GAC TGT AAA GAC AGC CTG CTG GGA GGT GTT CTG AGG GTG CTG GTG AAT TCT CTG AAC 
leu asp cys lys asp ser leu leu gly gly val leu arg val leu val asn ser leu asn 

1081 1111 

TGT GAT CAG AGT ACC ACC TAC CTG ACT CAC TGC TTT GCA ACA CTC CGT GCT CTC ATC GCC 
cys asp gin ser thr thr tyr leu thr his cys phe ala thr leu arg ala leu ile ala 

1141 1171 

AAG TTT GGA GAC TTA CTC TTC GAA GAG GAG GTG GAA CAG TGT TTC GAC CTA TGT CAC CAA 
lys phe gly asp leu leu phe glu glu glu val glu gin cys phe asp leu cys his gin 

1201 1231 

GTC CTG CAC CAC TGC AGC AGC AGC ATG GAT GTC ACC CGG AGC CAA GCC TGT GCC ACQ CTT 
val leu his his cys ser ser ser met asp val thr arg ser gin ala cys ala thr leu 

1261 1291 

TAC CTC CTC ATG AGG TTC AGT TTT GGA GCC ACC AGT AAT TTT GCA AGA GTA AAG ATG CAA 
tyr leu leu met arg phe ser phe gly ala thr ser asn phe ala arg val lys met gin 

1321 1351 

GTA ACC ATG TCC CTG GCA TCT TTG GTG GGA AGA GCA CCA GAC TTT AAT GAA GAG CAC CTG 
val thr met ser leu ala ser leu val gly arg ala pro asp phe asn glu glu his leu 

1381 1411 

AGA AGA TCC TTG AGG ACA ATT TTG GCC TAT TCA GAA GAG GAC ACA GCC ATG CAG ATG ACT 
arg arg ser leu arg thr ile leu ala tyr ser glu glu asp thr ala met gin met thr 

1441 1471 

CCT TTT CCC ACC CAG GTG GAG GAA CTT CTC TGT AAT CTG AAT AGC ATC TTA TAT GAC ACA 
pro phe pro thr gin val glu glu leu leu cys asn leu asn ser ile leu tyr asp thr 

1501 1531 

GTG AAA ATG AGG GAA TTT CAG GAA GAT CCT GAG ATG CTT ATG GAT CTC ATG TAC AGA ATT 
val lys met arg glu phe gin glu asp pro glu met leu met asp leu met tyr arg ile 

1561 1591 

GCC AAG AGT TAC CAG GCA TCT CCT GAT CTG CGG CTG ACC TGG CTC CAG AAC ATG GCA GAG 
ala lys ser tyr gin ala ser pro asp leu arg leu thr trp leu gin asn met ala glu 

1521 I xxxxxxxxxxxxxxxxxxxxxx transmembrane domain xxxxxxx 

AAA CAC ACC AAG AAG AAG TGC TAC ACG GAG GCT GCC ATG TGC CTG GTG CAC GCC GCT GCG 
lys his thr lys lys lys cys tyr thr glu ala ala met cys leu val his ala ala ala 

xxxxxxxxxxxxxxxxxxxxxxx I 1711 

TTA GTG GCT GAG TAT CTG AGC ATG CTG GAG GAC CAC AGC TAC CTG CCC GTG GGC AGT GTC 
leu val ala glu tyr leu ser met leu glu asp his ser tyr leu pro val gly ser val 

1741 1771 

AGC TTC CAG AAT ATT TCT TCC AAT GTG CTG GAG GAG TCT GTG GTC TCT GAG GAC ACC CTG 
ser phe gin asn ile ser ser asn val leu glu glu ser val val ser glu asp thr leu 

1801 1831 

TCA CCT GAC GAG GAT GGG GTG TGC GCA GGC CAG TAC TTC ACC GAG AGT GGC CTG GTA GGC 
ser pro asp glu asp gly val cys ala gly gin tyr phe thr glu ser gly leu val gly 
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1861 1891 

CTC CTG GAG CAG GCC GCG GAG CTC TTC AGC ACG GGA GGC TTA TAT GAG ACA GTT AAT GAG 

leu leu glu gin ala ala glu leu phe ser thr gly gly leu tyr glu thr val asn glu 

1921 1951 

GTC TAG AAG CTG GTC ATC CCC ATC CTA GAA GCG CAT CGA GAA TTC CGG AAG CTG ACA CTC 

val tyr lys leu val lie pro ile leu glu ala his arg glu phe arg lys leu thr leu 

1981 2011 

ACT CAC AGC AAG CTG CAG AGA GCC TTC GAC AGC ATC GTT AAC AAG GAT CAT AAG AGA ATG 

thr his ser lys leu gin arg ala phe asp ser ile val asn lys asp his lys arg met 

2041 Ixxxxx ITAM xxxx i 2071 

TTT GGA ACC TAC TTC CGA GTT GGT TTC TTT GGA TCC AAA TTT GGG GAT TTG GAT GAA CAG 

phe gly thr tyr phe arg val gly phe phe gly ser lys phe gly asp leu asp glu gin 

2101 ' 2131 

GAG TTT GTC TAC AAA GAG CCT GCA ATT ACC AAG CTT CCT GAG ATC TCA CAT AGA CTA GAG 

glu phe val tyr lys glu pro ala ile thr lys leu pro glu ile ser his arg leu glu 

2161 2191 

GCA TTT TAT GGT CAA TGT TTT GGT GCA GAA TTT GTG GAA GTG ATT AAA GAC TCC ACT CCT 

ala phe tyr gly gin cys phe gly ala glu phe val glu val ile lys asp ser thr pro 

2221 2251 

GTG GAC AAA ACC AAG TTG GAT CCT AAC AAG GCC TAC ATA CAG ATC ACT TTT GTG GAG CCC 

val asp lys thr lys leu asp pro asn lys ala tyr ile gin ile thr phe val glu pro 

2281 2311 

TAC TTT GAT GAG TAT GAG ATG AAA GAC AGG GTC ACA TAC TTT GAG AAG AAT TTC AAC CTC 

tyr phe asp glu tyr glu met lys asp arg val thr tyr phe glu lys asn phe asn leu 

2341 2371 

CGG AGG TTC ATG TAC ACC ACC CCG TTC ACC CTG GAG GGG CGG CCT CGG GGA GAG CTG CAT 

arg arg phe met tyr thr thr pro phe thr leu glu gly arg pro arg gly glu leu his 

2401 2431 

GAG CAG TAC AGA AGG AAC ACA GTC CTG ACC ACT ATG CAC GCC TTC CCC TAC ATC AAG ACC 

glu gin tyr arg arg asn thr val leu thr thr met his ala phe pro tyr ile lys thr 

2461 2491 I xxxxxxxxxxxxxxxxxxxxxxx 

AGG ATC AGC GTC ATC CAG AAG GAG GAG TTT GTT TTG ACA CCG ATT GAA GTT GCC ATT GAA 

arg ile ser val ile gin lys glu glu phe val leu thr pro ile glu val ala ile glu 

xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx Coiled coil 1 xxxxxxxxxxxxxxxxxxxxxxxxxxxxxx 

GAC ATG AAG AAG AAG ACC CTG CAG TTA GCA GTT GCC ATT AAC CAG GAG CCG CCT GAT GCA 

asp met lys lys lys thr leu gin leu ala val ala ile asn gin glu pro pro asp ala 

xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx I 2 611 

AAG ATG CTT CAG ATG GTG CTG CAA GGC TCT GTG GGA GCT ACT GTA AAT CAG GGA CCA CTG 

lys met leu gin met val leu gin gly ser val gly ala thr val asn gin gly pro leu 

2541 2671 

GAA GTA GCC CAA GTG TTT TTG GCT GAA ATT CCT GCT GAT CCA AAA CTC TAT CGA CAT CAC 

glu val ala gin val phe leu ala glu ile pro ala asp pro lys leu tyr arg his his 

2701 2731 i xxxxxxxxxxx 

AAC AAG TTG AGG TTA TGC TTT AAG GAA TTC ATC ATG AGA TGT GGT GAA GCT GTA GAG AAA 

asn lys leu arg leu cys phe lys glu phe ile met arg cys gly glu ala val glu lys 

xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx Coiled coil 2 xxxxxxxxxxxxxxxxxxxxxxxxxxxxxxx 

AAC AAG CGT CTC ATC ACG GCA GAC CAG AGG GAA TAT CAG CAG GAA CTC AAA AAG AAC TAT 

asn lys arg leu ile thr ala asp gin arg glu tyr gin gin glu leu lys lys asn tyr 



FIG. 1 (3 of 5) 



XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXJ 

AAC AAG CTA AAA GAG AAC CTC AGG CCA ATG 
asn lys leu lys glu asn leu arg pro met 

2881 

CCA ATA TTC AGA GTT GAG AGT CAA AAG AGG 
pro lie phe arg val glu ser gin lys arg 

2941 

TGT GAA ACC CAG TTG TCA CAG GGC AGC TAA 
cys glu thr gin leu ser gin gly ser OCH 

3001 

GCC CTG CAA CCC TGG AGA AGG ACT TGC TGG 
ala leu gin pro trp arg arg thr cys trp 

3061 ; 
GAC TGA CTG TAG ACT CCC TGA TCA GCC AGC 
asp STP 

3121 

TGG AAT TAT TCC CAA ATG GAC TCT GAC CAG 
3181 

GAG GAT GGG TAG TCA GGC ATG ACT GCG TAT 
3241 

AAA CAA GGC ATA AGC AGC TTC TCC TGC TGA 
3301 

TTT CCT CTG GCC CAT ATT TGA ATT TAT TGG 
3361 

AAA ATT ATC CAC CAG TCG ATT CAA ACT GAA 
3421 

TTG TAG GAG TAG GAA ACA TTT TCA ATA AAT 
3481 

AAA TCA TCA TGG TTG GAA ATT TGG GAG GAG 
3541 

ATG GTG GAC TAA TTG CTG TAT AGT TAT TTT 
3601 

CAT GCA TTT ATA GAA GAA TAG ATT CAA AGC 
3661 

CAG TCA GCC AAA AAT CAC AGA TAG TGC TTT 
3721 

CTT TGC TTT TTT TCT TAT GTG ACT CTT GTG 
3781 

AGT TTC TTT TTA TAA AGC AAT AAT ATC TCT 
3841 

CAG CAT ATG TAT ATC AGC TAG AAA ATA TAT 
3901 

TAG GAA AAG GAG GAA CAA AGA CAT TAT TTG 
3961 

GTG ACC TTG ACT GAT AAT AAA GAT GTA ATA 



:xxxxxxxxxxxxxxxxxxx 1 

ATC GAG CGG AAA ATT CCA GAA CTG TAG AAG 
lie glu arg lys lie pro glu leu tyr lys 

2911 

GAC TCC TTC CAC AGA TCT AGT TTC AGG AAA 
asp ser phe his arg ser ser phe arg lys 

2971 

GAA AAG CCA TCT TCA TTC GTG GAG ACT GTG 
glu lys pro ser ser phe val glu thr val 

3031 

TAG TTA AAA AAT GGG ACA TTT GCC ACC CAG 
tyr leu lys asn gly thr phe ala thr gin 

S091 

ACT CTG GAA GCT TTG GGA TCC CAG GAA CCA 



3151 

ATT TTT GCC ATA CTG GGG GGT GGC GGG ATG 
3211 

TTA TTA AAG TGT GTT TTT CCA CAA TGT ACC 
3271 

CTG GCC AAT CAC TGC CCA TCT GAG AGA TGA 
3331 

AGT AAC TCA AAT TGC CTG AGG AAA AAT GGA 
3391 

TTT CAC TCT TTA TAG GAA GGC AGG GCA AAC 
3451 

CTA CAA AGG GAA GCC TTA CTA CAA TTC CAA 
3511 

ATT ATT TGT GAA CTT GTT ACC CTT TTG GTA 

3571 

TGT TTT ATT ATT ACT GTT ACA TTA ATT TAA 
3631 

ACT GAT GTA GGA GAT ACA CGG TAG TTG GAG 
3691 

CAC TTA AAT GGA AAC AAT TCT GCG ATA ATG 
3751 

TAG TAT CTA TTT TTC TCC TCT CTG GGA CCA 
3811 

GTT TTC ATT TCA GAA CAT TGT GCT GTC TGT 
3871 

TCA ACT TTG ACT TCT TTT GAC AAA GGA CTT 
3931 

AGA ATT AAA TTA TAT ATT TTT AAT ATG ACT 
3991 

AGA ATT GCA AGC TAA AAA AAA AAA AAA AAA 
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4021 
AAC TCG 
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LU 
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7.5 kb 
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HC2A 

KIAA ASGNLDKNARFSAIYRQDSNKLSNDDMLKLLADFRKPEKMAKLPVILGNLDITIDNVSSD 

rat 

HC4 — 

HCl 

HC3 

HC5 

HC2A ^ 

KIAA FPNYVNSSYIPTKQFETCSKTPITFEVEEFVPCIPKHTQPYTIYTNHLYVYPKYLKYDSQ 

rat 

HC4 

HCl 

HC3 

HC5 

HC2A VLHHHQNPE F YDE I K 

KIAA KSFAKARNIAICIEFKDSDEEDSQPLKCIYGRPGGPVFTRSAFAAVLHHHQNPEFYDEIK 

rat 

HC4 

HCl 

HC3 

HC5 

HC2A lELPTQLHEKHHLLLTFFHVSCDNSSKGSTKKRDVVETQVGYSWLFLLKDGRWTSEQHI 

KIAA lELPTQLHEKHHLLLTFFHVSCDNSSKGSTKKRDVVETQVGYSWLPLLKDGRVVTSEQHI 

rat 

HC4 

HCl 

HC3 

HC5 

HC2A PVSANLPSGYLGYQELGMGRHYGPEIKWVDGGKPLLKISTHLVSTVYTQDQHLHNFFQYC 

KIAA PVSANLPSGYLGYQELGMGRHYGPEIKWVDGGKPLLKISTHLVSTVYTQDQHLHNFFQYC 

rat 

HC4 

— — — — — — — 

HC3 GPGPARSTVSISLISNSARV 

HC5 

HC2A QKTESGAQALGNELVKYLKSLHAMEGHVMIAFLPTILNQLFRVLT-RATQEEVAVNVTRV 

KIAA QKTESGAQALGNELVKYLKSLHAMEGHVMIAFLPTILNQLFRVLT-RATQEEVAVNVTRV 

rat 

HC4 ME I Q VL I RFL S V I LMQL FW VL PNM IHEDDVPISC PMV 

HCl MSFLPIILNQLFKVLV-QNEEDEITTTVTRV 

HC3 NRSRSLSNSNPDISGTPTSPDDEVRSIIGSKGLDRSNSWVNTGGPKAAPWGSNPSPSAES 

HC5 
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HC2A 

KIAA 

rat 

HC4 

HCl 

HC3 

HC5 



IIHVVAQCHEEGLESHLRSYVKYAYKAEPYVASEYKTVHEELTKSMTTILKPSADFLTSN 
IIHVVAQCHEEGLESHLRSYVKYAYKAEPYVASEYKTVHEELTKSMTTILKPSADFLTSN 

LFHIVSKCHEEGLDSYLSSFIKYSFRPGKPSAPQAPLIHETLATMMIALLKQSADFLAIN 

LPDIVAKCHEEQLDHSVQSYIKFVFKTR ACKERPVHEDLAKNVTGLLK-SNDSPTVK 

TQAMDRSCNRMSSHTETSSFLQTLTGRLP TKKLFHEELALQWWCSG— SVR E 

Cadherin 



HC2A 

KIAA 

rat 

HC4 

HCl 

HC3 

HC5 



KLLRYSWFFFDVLIKSMAQHLIENSKVKLL 
KLLKYSWFFFDVLIKSMAQHLIENSKVKLL 


RNQR 
RNQR 


FPASYHHAAETVVNMLMPHITQKFGD 
FPASYHHAVETVVNMLMPHITQKFRD 


KLLKYSWFFFEI lAKSMATYLLEENKIKLT 
HVLKHSWFFFAIILKSMAQHLIDTNKIQLP 
SALQQAWFFFELMVKSMVHHLYFNDKLEAP 


HGQR 
RPQR 
RKSR 


FPKAYHHALHSLFLAIT-IVESQYAE 
F PE S YQNE LDNL VMVL S DH V I WKYKD 
FPERFMDDIAALVSTIASDIVSRFQK 



HC2A 

KIAA 

rat 

HC4 

HCl 

HC3 

HC5 



NPEASKNANHSLAVFIKRCFTFMDRGFVFKQIN NYIS — CFAPGDPKTLFEYKFEFL 

NPEASKNANHSLAVFIKRCFTFMDRGFVFKQIN NYIS~CFAPGDPKTLFEYKFEFL 

IPKESRNVNYSLASFLKCCLTLMDRGFVFNLIN DYIS — GFSPKDPKVLAEYKFEFL 

ALEETRRATHSVARFLKRCFTFMDRGCVFPCMVN NYIS — MFSSGDLKTLCQYKFDFL 

DTEMVERLNTSLAFFLNDLLSVMDRGFVFSLIKSCYKQVSSKLYSLPNPSVLVSLRLDFL 



HC2A 

KIAA 

rat 

EC 4 

HCl 

HC3 

HC5 



RWCNHEH Y I PLNL PM- 
RWCNHEH Y I P LNL PM- 



-PFGKGRIQR- 
-PFGKGRIQR- 



-YQDLQL DYSLTDEF 

-YQDLQL DYSLTDEF 



QTICNHEHYIPLNLPM AFAKPKLQR VQDSNL EYSLSDEY 

QEVCQHEHFIPLCLPIRSANIPDPLTPSES TQELHASDMPEYSVTNEF 

RIICSHEHYVTLNLPCSLLTPPASPSPSVSSATSQSSGFSTKVQDQKIANMFELS — VPF 
MNADTAPTSPCPSIS SQNSSSCSSFQDQKIASMFDRTSRVPA 



HC2A 

KIAA 

rat 

HC4 

HCl 

HC3 

HC5 



Cadherin 
EC motif 



CRNHFLVGI 
CRNHFLVGI 


LLRE 
LLRE 


VGTALQEFRE VRLIAISVLKNLLIKHSFDDRYASRSHQARIAT 

VGTALQEFRE VRLIAISVLKNLLIKHSFDDRYASRSHQARIAT 


CKHHFLVGI 
CRKHFLIGI 
RQQHYLAGL 
SSTS-SPGI 


LLRE 
LLRE 
VLTE 
LFTE 


TSIALQDNYE IRYTAISVIKNLLIKHAFDTRYQHKNQQAKIAQ 

VGFALQEDQD VRHLALAVLKNLMAKHSFDDRYREPRKQAQIAS 

LAVILDPDAEGLFGLHKKVINMVHNLLSSHDSDPRYSDPQIKARVAM 
LAAALDAEGEGISEVQRKAVSAIHSLLSSHDLDPRCVKPEVKVKIAA 



HC2A 

KIAA 

rat 

HC4 

HCl 

HC3 

HC5 



LYLPLFGLLIENVQRINVRDVSPFPVNAG-MTVKDESLALPAVNPLVTPQKGSTLDNSLH 
LYLPLFGLLIENVQRINVRDVSPFPVNAG-MTVKDESLALPAVNPLVTPQKGSTLDNSLH 

LYLPFVGLLLENIQRLAGRDTLYSCAAMPNSASRDEFPCG FTSP — AN — RGSLS 

LYMPLYGMLLDNMPRIYLKDLYPFTVNTSNQGSRDDLSTNGGFQSQTAIKHANSVDTSFS 

LYLPLIGIIMETVPQLYDFTETHNQRGRPICIATDDYESE SG SMIS 

LYLPLVGIILDALPQLCDFTVADTRRYR TSGSDEEQE GA GAIT 
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HC2A KDLLGAISGIASPYTTSTPNINSVRNADSRGSLISTDSGNSLPERNSEKSNSLDKHQQSS 

KIAA KDLLGAISGIASPYTTSTPNINSVRNADSRGSLISTDSGNSLPERNSEKSNSLDKHQQSS 

rat 

HC4 TDKDTAYGSFQNG HGIKREDSRGSLIP-EGATGFPDQGNTGEN TRQS 

HCl KDVLNSIAAFSS lAISTVNHADSRASLASLDSNPSTNEKSSEKTDNCEKIPRPL 

HC3 QTVAMAIAGTSVPQ LTRPGSFLLTSTSGRQHT 

HC5 QNVALAIAGNNFN LKTSG-IVLSSLPYKQYN 

HC2A TLGNSVVRCDKLDQSEIKSLLMCFLYILKSMSDDALFTYWN-KASTSELMDFFTISEVCL 

KIAA TLGNSWRCDKLDQSEIKSLLMCFLYILKSMSDDALFTYWN-KASTSELMDFFTISEVCL 

rat 

HC4 STRSSVSQYNRLDQYEIRSLLMCYLYIVKMISEDTLLTYWN-KVSPQELINILILLEVCL 

HCl ALIGSTLRFDRLDQAETRSLLMCFLHIMKTISYETLIAYWQ-RAPSPEVSDFFSILDVCL 

HC3 TFSAESSRSLLICLLWVLKN-ADETVLQKWFTDLSVLQLNRLLDLLYLCV 

HC5 MLNADTTRNLMICFLWIMKN-ADQSLIRKWIADLPSTQLNRILDLLFICV 

HC2A HQFQYMGKRYIARNQEGLG— PIVHDRKS QTLPVSRNRTGMM 

^jp^j^ HQFQ YMGKRY I AR TGMM 

rat 

HC4 FHFRYMGKRNIARVHDAWLSKHFGIDRKS QTMPALRNRSGVM 

HCl QNFRYLGKRNIIRKIAAAF—KFVQSTQNNGTLKGSNPSCQTSGLLAQWMHSTSRHEGHK 

HC3 SCFEYKGKKVFERMNSLTFK—KSKDMRAK LEEAILGSIGARQEMV 

HC5 LCFEYKGKQSSDKVSTQVLQ — KSRDVKAR LEEALLRGEGARGEMM 

HC2A HARLQQL GSLDKS LTFNHSYGHSDADVLHQSLLEANIATEVC 

KIAA HARLQQL GSLDNS LTFNHSYGHSDADVLHQSLLEANIATEVC 

rat 

HC4 QARLQHL SSLESS FTLNHSSTTTEADIFHQALLEGNTATEVS 

HCl QHRSQTLPIIRGK NALSNPKL LQMLDNTMTSNSNEIDIVHHVDTEANIATEGC 

HC3 RRSRGQLERSPSGSAFGSQENLRWRKDMTHWRQNTEKLDKSRAEIEHEALIDGNLATEAN 

HC5 RRRAPGNDRFP GLNENLRWKKEQTHWRQANEKLDKTKAELDQEALI SGNLATEAH 

HC2A LTALDTLSLFTLAFKNQLLADHGHNPLMKKVFDVYLCFLQKHQSETALKNVFTALRSLIY 

KIAA LTALDTLSLFTLAFKNQLLADHGHNPLMKKVFDVYLCFLQKHQSETALKNVFTALRSLIY 

rat KLSRGHSPLMKKVFDVYLCFLQKHQSEMALKNVFTALRSLIY 

HC4 LTVLDTISFFTQCFKTHFLNNDGHNPLMKKVFDIHLAFLKNGQSEVSLKHVFASLRAFIS 

HCl LTILDLVSLFTQTHQRQLQQCDCQNSLMKRGFDTYMLFFQVNQSATALKHVFASLRLFVC 

HC3 LIILDTLEIWQTVS— VTES—KESILGGVLKVLLHSMACNQSAVYLQHCFATQRALVS 

HC5 LIILDMQENIIQASS~ALDC~KDSLLGGVLRVLVNSLNCDQSTTYLTHCFATLRALIA 

HC2A KFPSTFYEGRADMCAALCYEILKCCNSKLSSIRTEASQLLYFLMRNNFDYTGKKSFVRTH 

KIAA KFPSTFYEGRADMCAALCYEILKCCNSKLSSIRTEASQLLYFLMRNNFDYTGKKSFVRTH 

rat KFPSTFYEGRADMCASLCYEVLKCCNSKLSSIRTEASQLLYFLMRNNFDYTGKKSFVRTH 

HC4 KFPSAFFKGRVNMCAAFCYEVLKCCTSKISSTRNEASALLYLLMRNNFEYTKRKTFLRTH 

HCl KFPSAFFQGPADLCGSFCYEVLKCCNHRSRSTQTEASALLYLFMRKNFSFNKQKSIVRSH 

HC3 KFPELLFEEETEQCADLCLRLLRHCSSSIGTIRSHPSASLYLLMRQNFEIGN — NFARVK 

HC5 KFGDLLFEEEVEQCFDLCHQVLHHCSSSMDVTRSQACATLYLLMRFSFGATS — NFARVK 

HC2A LQVIISVSQLIADVVGIGETRFQQSLSIINNCANSDRLIKHTSFSSDVKDLTKRIRTVLM 

KIAA LQVIISVSQLIADWGIGGTRFQQSLSIINNCANSDRLIKHTSFSSDVKDLTKRIRTVLM 

rat LQVIISLSQLIADVVGIGGTRFQQSLSIINNCANSDRLIKHTSFSSDVKDLTKRIRTVLM 

HC4 LQIIIAVSQLIADVALSGGSRFQESLFIINNFANSDRPMLARAFPAEVKDLTKRIRTVLM 

HCl LQLIKAVSQLIAD-AGIGGSRFQHSLAITNNFANGDKQMKNSNFPAEVKDLTKRIRTVLM 

HC3 MQVPMSLSSLVGTSQNFNEEFLRRSLKTILTYAEEDLELRETTFPDQVQDLVFNLHMILS 

HC5 MQVTMSLASLVGRAPDFNEEHLRRSLRTILAYSEEDTAMQMTPFPTQVEELLCNLNSILY 
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TCG ACT ATG AAT 
ser thr met asn 

61 

TCA AGC TCC TGC 
ser ser ser cys 

121 

AGA GTA CCC GCC 
arg val pro ala 

181 

CTG GAT GCC GAA 
leu asp ala glu 

241 

AGC CTG CTA AGT 
ser leu leu ser 

301 

ATC GCC GCC CTT 
lie ala ala leu 

361 

GAC TTT ACA GTT 
asp phe thr val 

421 

GGA GCC GGT GCC 
gly ala gly ala 

481 

AAA ACA AGT GGA 
lys thr ser gly 

541 

GAC ACT ACT CGC 

asp thr thr arg 
601 

CTC ATT AGG AAG 
ieu lie arg lys 

661 

CTT TTC ATC TGT 
leu phe lie cys 

721 

ACC CAA GTC CTG 
thr gin val leu 
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GGG GAA GGG GCC 
gly glu gly ala 



GCT GAT ACT GCT 
ala asp thr ala 

TCC AGC TTC CAG 
ser ser phe gin 

AGC AGC ACT TCC 
ser ser thr ser 



GGG GAA GGA ATC 
gly glu gly lie 

TCT CAC GAC CTG 
ser his asp leu 



TAC CTA CCT TTA 
tyr leu pro leu 

GCA GAT ACT CGC 
ala asp thr arg 

ATT ACC CAG AAT 
lie thr gin asn 

ATA GTG CTG TCT 
lie val leu ser 



AAC CTC ATG ATC 

asn leu met ile 



TGG ATT GCT GAC 
trp ile ala asp 

GTG TTA TGT TTT 
val leu cys phe 



CAG AAG TCA AGG 
gin lys ser arg 

AGA GGG GAG ATG 
arg gly glu met 



31 

CCA ACA TCT CCT 
pro thr ser pro 

91 

GAC CAG AAG ATC 
asp gin lys ile 

151 

TCA CCG GGG CTC 
ser pro gly leu 

211 

AGC GAA GTA CAA 
ser glu val gin 

271 

GAC CCA CGC TGT 
asp pro arg cys 

331 

GTT GGC ATC ATT 
val gly ile ile 

391 

AGA TAC CGC ACC 
arg tyr arg thr 

451 

GTG GCT CTG GCC 

val ala leu ala 

I ref 2.1 

I 511 

TCC TTG CCC TAT 

ser leu pro tyr 

571 

TGC TTC CTC TGG 

cys phe leu trp 
631 

CTG CCA TCA ACG 
leu pro ser thr 

691 

GAG TAT AAG GGA 
glu tyr lys gly 

751 

GAT GTC AAG GCC 
asp val lys ala 

811 

ATG CGC CGC CGG 
met arg arg arg 



TGT CCT TCC ATA 
cys pro ser ile 

GCC AGC ATG TTC 
ala ser met phe 

CTC TTC ACA GAA 
leu phe thr glu 

AGG AAA GCT GTC 
arg lys ala val 

GTC AAA CCA GAG 
val lys pro glu 

TTG GAT GCT TTG 
leu asp ala leu 

AGT GGC TCG GAT 
ser gly ser asp 

ATA GCA GGG AAT 
lie ala gly asn 

AAG CAG TAC AAC 
lys gin tyr asn 

ATC ATG AAA AAT 
ile met lys asn 



CAG CTC AAC AGG 
gin leu asn arg 



AAA CAG AGT TCT 
lys gin ser ser 



CGG CTG GAA GAG 
arg leu glu glu 

GCT CCA GGG AAC 
ala pro gly asn 



ref 

TCT TCC CAG AAC 
ser ser gin asn 

GAT CGG ACT TCC 
asp arg thr ser 

CTG GCT GCT GCC 
leu ala ala ala 



AGT GCA ATT CAC 
ser ala ile his 



GTG AAG GTC AAA 
val lys val lys 

CCA CAG CTC TGT 
pro gin leu cys 

GAA GAA CAA GAA 
glu glu gin glu 

AAT TTC AAT TTG 
asn phe asn leu 

ATG CTG AAC GCG 
met leu asn ala 

GCT GAT CAG AGC 
ala asp gin ser 



ATT TTA GAT CTA 
ile leu asp leu 



GAC AAA GTC AGT 
asp lys val ser 

GCT TTG CTG CGT 
ala leu leu arg 

GAC CGA TTT CCA 
asp arg phe pro 
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841 871 

GGC CTA AAT GAA AAT TTG AGA TGG AAG AAA GAG GAG ACA CAT TGG CGG CAA GOT AAT GAG 
gly leu asn glu asn leu arg trp lys lys glu gin thr his trp arg gin ala asn glu 

901 931 

AAG CTA GAT AAA ACA AAG GCC GAG TTA GAT CAA GAA GCC TTG ATC AGT GGC AAT CTG GCT 
lys leu asp lys thr lys ala glu leu asp gin glu ala leu ile ser gly asn leu ala 

961 991 

ACA GAA GCA CAT TTA ATC ATC CTG GAT ATG CAG GAA AAC ATT ATC CAG GCG AGC TCG GCT 
thr glu ala his leu ile ile leu asp met gin glu asn ile ile gin ala ser ser ala 

1021 1051 

CTG GAC TGT AAA GAC AGC CTG CTG GGA GGT GTT CTG AGG GTG CTG GTG AAT TCT CTG AAC 
leu asp cys lys asp ser leu leu gly gly val leu arg val leu val asn ser leu asn 

I ref 3.1 

1081 1111 f 

TGT GAT CAG AGT ACC ACC TAC CTG ACT CAC TGC TTT GCA ACA CTC CGT GCT CTC ATC GCC 
cys asp gin ser thr thr tyr leu thr his cys phe ala thr leu arg ala leu ile ala 

1141 1171 

AAG TTT GGA GAC TTA CTC TTC GAA GAG GAG GTG GAA CAG TGT TTC GAC CTA TGT CAC CAA 
lys phe gly asp leu leu phe glu glu glu val glu gin cys phe asp leu cys his gin 

1201 1231 

GTC CTG CAC CAC TGC AGC AGC AGC ATG GAT GTC ACC CGG AGC CAA GCC TGT GCC ACC CTT 
val leu his his cys ser ser ser met asp val thr arg ser gin ala cys ala thr leu 

1261 1291 

TAC CTC CTC ATG AGG TTC AGT TTT GGA GCC ACC AGT AAT TTT GCA AGA GTA AAG ATG CAA 
tyr leu leu met arg phe ser phe gly ala thr ser asn phe ala arg val lys met gin 

1321 1351 

GTA ACC ATG TCC CTG GCA TCT TTG GTG GGA AGA GCA CCA GAC TTT AAT GAA GAG CAC CTG 
val thr met ser leu ala ser leu val gly arg ala pro asp phe asn glu glu his leu 

1381 1411 

AGA AGA TCC TTG AGG ACA ATT TTG GCC TAT TCA GAA GAG GAC ACA GCC ATG CAG ATG ACT 
arg arg ser leu arg thr ile leu ala tyr ser glu glu asp thr ala met gin met thr 

1441 1471 

CCT TTT CCC ACC CAG GTG GAG GAA CTT CTC TGT AAT CTG AAT AGC ATC TTA TAT GAC ACA 
pro phe pro thr gin val glu glu leu leu cys asn leu asn ser ile leu tyr asp thr 

1501 1531 

GTG AAA ATG AGG GAA TTT CAG GAA GAT CCT GAG ATG CTT ATG GAT CTC ATG TAC AGA ATT 
val lys met arg glu phe gin glu asp pro glu met leu met asp leu met tyr arg ile 

1561 1591 

GCC AAG AGT TAC CAG GCA TCT CCT GAT CTG CGG CTG ACC TGG CTC CAG AAC ATG GCA GAG 
ala lys ser tyr gin ala ser pro asp leu arg leu thr trp leu gin asn met ala glu 

1621 1651 

AAA CAC ACC AAG AAG AAG TGC TAC ACG GAG GCT GCC ATG TGC CTG GTG CAC GCC GCT GCG 
lys his thr lys lys lys cys tyr thr glu ala ala met cys leu val his ala ala ala 
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1681 

TTA GTG GCT GAG TAT 
leu val ala glu tyr 

1741 

AGC TTC CAG AAT ATT 
ser phe gin asn ile 
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TCA CCT GAC GAG GAT 
ser pro asp glu asp 
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CTC CTG GAG CAG GCC 
leu leu glu gin ala 

1921 

GTC TAG AAG CTG GTC 
val tyr lys leu val 

1981 

ACT CAC AGC AAG CTG 

thr his ser lys leu 

2041 

TTT GGA ACC TAC TTC 
phe gly thr tyr phe 

2101 

GAG TTT GTC TAC AAA 
glu phe val tyr lys 

2161 

GCA TTT TAT GGT CAA 
ala phe tyr gly gin 
■ ref 4 
2221 f 
GTG GAC AAA ACC AAG 
val asp lys thr lys 

2281 

TAC TTT GAT GAG TAT 
tyr phe asp glu tyr 

2341 

CGG AGG TTC ATG TAC 
arg arg phe met tyr 

2401 

GAG CAG TAC AGA AGG 
glu gin tyr arg arg 

2461 

AGG ATC AGC GTC ATC 
arg ile ser val ile 



CTG AGC ATG CTG GAG 
leu ser met leu glu 

TCT TCC AAT GTG CTG 
ser ser asn val leu 



GGG GTG TGC GCA GGC 
gly val cys ala gly 



GCG GAG CTC TTC AGC 
ala glu leu phe ser 

ATC CCC ATC CTA GAA 
ile pro ile leu glu 

CAG AGA GCC TTC GAC 
gin arg ala phe asp 

CGA GTT GGT TTC TTT 
arg val gly phe phe 

GAG CCT GCA ATT ACC 
glu pro ala ile thr 

TGT TTT GGT GCA GAA 
cys phe gly ala glu 
1 

TTG GAT CCT AAC AAG 
leu asp pro asn lys 

GAG ATG AAA GAC AGG 
glu met lys asp arg 



ACC ACC CCG TTC ACC 
thr thr pro phe thr 



AAC ACA GTC CTG ACC 
asn thr val leu thr 



CAG AAG GAG GAG TTT 
gin lys glu glu phe 



1711 

GAC CAC AGC TAC CTG 
asp his ser tyr leu 

1771 

GAG GAG TCT GTG GTC 
glu glu ser val val 

1831 

CAG TAC TTC ACC GAG 
gin tyr phe thr glu 

1891 

ACG GGA GGC TTA TAT 
thr gly gly leu tyr 

1951 

GCG CAT CGA GAA TTC 
ala his arg glu phe 

2011 

AGC ATC GTT AAC AAG 
ser ile val asn lys 

2071 

GGA TCC AAA TTT GGG 
gly ser lys phe gly 

2131 

AAG CTT CCT GAG ATC 
lys leu pro glu ile 

2191 

TTT GTG GAA GTG ATT 
phe val glu val ile 

2251 

GCC TAC ATA CAG ATC 
ala tyr ile gin ile 

2311 

GTC ACA TAC TTT GAG 
val thr tyr phe glu 

2371 

CTG GAG GGG CGG CCT 
leu glu gly arg pro 

2431 

ACT ATG CAC GCC TTC 
thr met his ala phe 

2491 

GTT TTG ACA CCG ATT 
val leu thr pro ile 



CCC GTG GGC AGT GTC 
pro val gly ser val 



TCT GAG GAC ACC CTG 
ser glu asp thr leu 



AGT GGC CTG GTA GGC 
ser gly leu val gly 



GAG ACA GTT AAT GAG 
glu thr val asn glu 



CGG AAG CTG ACA CTC 
arg lys leu thr leu 

GAT CAT AAG AGA ATG 
asp his lys arg met 

GAT TTG GAT GAA CAG 
asp leu asp glu gin 

TCA CAT AGA CTA GAG 
ser his arg leu glu 



AAA GAC TCC ACT CCT 
lys asp ser thr pro 



ACT TTT GTG GAG CCC 
thr phe val glu pro 

AAG AAT TTC AAC CTC 
lys asn phe asn leu 



CGG GGA GAG CTG CAT 
arg gly glu leu his 

CCC TAC ATC AAG ACC 
pro tyr ile lys thr 

GAA GTT GCC ATT GAA 
glu val ala ile glu 
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2521 

GAC ATG AAG AAG AAG ACC CTG GAG TTA GCA 
asp met lys lys lys thr leu gin leu ala 

2581 

AAG ATG CTT GAG ATG GTG CTG CAA GGC TCT 
lys met leu gin met val leu gin gly ser 

2641 

GAA GTA GCC CAA GTG TTT TTG GCT GAA ATT 
glu val ala gin val phe leu ala glu lie 

2701 

AAC AAG TTG AGG TTA TGC TTT AAG GAA TTC 
asn lys leu arg leu cys phe lys glu phe 

2761 

AAC AAG CGT CTC ATG ACG GCA GAC CAG AGG 
asn lys arg leu ile thr ala asp gin arg 

2821 

AAC AAG CTA AAA GAG AAC CTC AGG CCA ATG 
asn lys leu lys glu asn leu arg pro met 

2881 

CCA ATA TTC AGA GTT GAG AGT CAA AAG AGG 
pro ile phe arg val glu ser gin lys arg 

2941 

TGT GAA ACC CAG TTG TCA CAG GGC AGC TAA 
cys glu thr gin leu ser gin gly ser OCH 

■ref 5.1 
3001 f 

GCC CTG CAA CCC TGG AGA AGG ACT TGC TGG 
3061 

GAC TGA CTG TAG ACT CCC TGA TCA GCC AGC 
3121 

TGG AAT TAT TCC CAA ATG GAC TCT GAC CAG 
3181 

GAG GAT GGG TAG TCA GGC ATG ACT GCG TAT 
3241 

AAA CAA GGC ATA AGC AGC TTC TCC TGC TGA 
3301 

TTT CCT CTG GCC CAT ATT TGA ATT TAT TGG 
3361 

AAA ATT ATG CAC CAG TCG ATT CAA ACT GAA 
3421 

TTG TAG GAG TAG GAA ACA TTT TCA ATA AAT 



2551 

GTT GCC ATT AAC CAG GAG CCG CCT GAT GCA 
val ala ile asn gin glu pro pro asp ala 

2611 

GTG GGA GCT ACT GTA AAT CAG GGA CCA CTG 
val gly ala thr val asn gin gly pro leu 

2671 

CCT GCT GAT CCA AAA CTC TAT CGA CAT CAC 
pro ala asp pro lys leu tyr arg his his 

2731 

ATG ATG AGA TGT GGT GAA GCT GTA GAG AAA 
ile met arg cys gly glu ala val glu lys 

2791 

GAA TAT CAG CAG GAA CTC AAA AAG AAC TAT 
glu tyr gin gin glu leu lys lys asn tyr 

2851 

ATG GAG CGG AAA ATT CCA GAA CTG TAG AAG 
ile glu arg lys ile pro glu leu tyr lys 

2911 

GAC TCC TTC CAC AGA TCT AGT TTC AGG AAA 
asp ser phe his arg ser ser phe arg lys 

2971 

GAA AAG CCA TCT TCA TTC GTG GAG ACT GTG 



3031 

TAC TTA AAA AAT GGG ACA TTT GCC ACC CAG 
3091 

ACT CTG GAA GCT TTG GGA TCC CAG GAA CCA 
3151 

ATT TTT GCC ATA CTG GGG GGT GGC GGG ATG 
3211 

TTA TTA AAG TGT GTT TTT CCA CAA TGT ACC 
3271 

CTG GCC AAT CAC TGC CCA TCT GAG AGA TGA 
3331 

AGT AAC TCA AAT TGC CTG AGG AAA AAT GGA 
3391 

TTT CAC TCT TTA TAG GAA GGC AGG GCA AAC 
3451 

CTA CAA AGG GAA GCC TTA CTA CAA TTC CAA 
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3481 3511 

AAA TCA TCA TGG TTG GAA ATT TGG GAG GAG ATT ATT TGT GAA CTT GTT ACC CTT TTG GTA 



3541 3571 

ATG GTG GAC TAA TTG CTG TAT AGT TAT TTT TGT TTT ATT ATT ACT GTT ACA TTA ATT TAA 

3601 3631 

CAT GCA TTT ATA GAA GAA TAG ATT CAA AGC ACT GAT GTA GGA GAT ACA CGG TAC TTG GAG 

3661 3691 

CAG TCA GCC AAA AAT CAC AGA TAC TGC TTT CAC TTA AAT GGA AAC AAT TCT CCG ATA ATG 

3721 3751 

CTT TGC TTT TTT TCT TAT GTC ACT CTT GTG TAC TAT CTA TTT TTC TCC TCT CTG GGA CCA 

3781 3811 

AGT TTC TTT TTA TAA AGC AAT AAT ATC TCT GTT TTC ATT TCA GAA CAT TGT GCT GTC TGT 

3841 3871 

CAG CAT ATG TAT ATC AGC TAC AAA ATA TAT TCA ACT TTG ACT TCT TTT GAC AAA GGA CTT 

3901 3931 

TAG GAA AAG GAG GAA CAA AGA CAT TAT TTG AGA ATT AAA TTA TAT ATT TTT AAT ATG ACT 

3961 3991 

GTG ACC TTG ACT GAT AAT AAA GAT GTA ATA AGA ATT GCA AGC TAA AAA AAA AAA AAA AAA 

4021 
AAC TCG 



Ref 1.1 

Sequence of BAC19 using primer HC5S11, which spans 
nucleotides 3-22 of the cDNA.Exon sequence is underlined 
and represents nucleotides 32-57, 

CTCTCTGTCTTCATATCTTCCAGG TTATAAAGNATTATTTACTAAAAGAANATTCANGC 
TATTTCATTTAACTAGCTCAGTTTAATCATGTATTTCCTATAAAGGTTAGTCTTATTAAT 
TTGACAAAACAATCAAACAATTCAAACCAGATCAAGTATGCTACCCTGAAGTTACACC 
ACTAGCTAAGAATTAACAATCTAAGTAATTGGTTTCTCCCCAGGCTCAAGGCTCCCTGA 
TCAGGTTAAGTAAAGCCAAGAATCCAATAAGCCCTATGAAATTTAGAAACTCATAGAA 
AAGTCTCAAATCTTCTTGTCTGACATTAGCCAATTGTTATATTATGCAAATAGAGGATT 
NCAAGTAAATAAGTTTGGAACCTGTTTACCAGGTTTTTGCAGCAGNCCTCTAAGAGCTT 
AACTGGTCATGCATTGAATGCCGAGAGCAAAGAGGAATGGAGAGGGGNTGTAAGNGG 
TTCCAATNTTACTGGAACCCACCACTATCTTTNGAAGTCTTGATACTTAACTGNGTGTA 
GNCTCTTTAGGCCTNTANTAANTAGAATCTATATGGATTCGTGTTCTGTCNGCAAGNAG 
TGCCTATGAAA 
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Ref 2.1 

Sequence of BAC19 using primer HCSASlOb, which spans 
nucleotides 560-580 of Exon sequence is underlined and 
represents nucleotides 510-553. 

TGCGAGTAGTGTCCGCGTTCAGCATGTTGTACTGCTTATAGGGC TGAAGGGAGGCACG 
ATTGGGGGATGGAGGCCAGGGAAGAAGTCAAGCAACAGAAAAATTTGAGGCTTAACA 
GTCAAGCAACAGAAAAATTCAAAGTGTTCTCTTAAAATACCATGACTGTACATCACTG 
CTAGGCTGGAGATCTATTGCCAGTAGCCCTGCCTTCCCTAGGCAGGGGAAGCTGTGTT 
CTTTGAGTAGCGCTACTCAGCAAAGAGGCTCACCTGGGGCAGTATTTGAGCTAGGCTT 
TCAGCCACCGTATCTGAGTACCTCTGTCTTANGAGCAGTGTGGCCTGGTGATCACCCCT 
GGGCCTTGATCATGCGTGCTGCAATCCCAGTGATACAAAGAGGCTTTCATGCTGCTAA 
GATCTCCAAGTATTTCTCCTTCGTGCTGGGCAGCAGAGGGTTAGACTTNCAGGGGAGA 
AGGAAACTGGCTGGGTGCCATGAATAANCTTGCTGTTCAAGANTTAACTTCTTTGTTAC 
ATAAGNGCAAAGGTATAACATAAAGGGNCATGAACTGCTCAACNAAATTNATCAAAT 
CCATGTTTGTGGGAGTTCTTTTGTNATNGGAAGTTTAACCCCTAA 

Ref 3 . 1 

Sequence of BAC13 using primer C5S3, which spans 

nucleotides 1086-1105 of the cDNA. Exon sequence is 

underlined and represents nucleotides 1110-1120. 

CCCNGCTCTTTTTGGCAA NGTAANCTTGGGATGCTTGTTTTCTTCCTCTTAATTAAGAG 

NAAGANTTTTTTTAGCTTCATACTTCTCTCTTCAGGGGGACCAAAAGTCACAGAGCATA 

TTAAGTGGCANAACCCCNAAGGTCTTAAGTCTTCCTAGGAAGAAAGCAGATGCCCTGA 

TTCTGTGGGAAGCCACCATGGAGAGGAAAAGCAGTGGCTCCCATATTTGAAGTGNGGA 

CCTAACTCTAGAAAGTTTAAAANGGCCATTTGCTGAAGGGCTATGACATGAGAACAGA 

GATCAACTGAGTGACTTAGCAANTTCACTCTTTCTCTGTAANACCTCTGGTGAGTGAGA 

NTAAATCCTNTATGTGACGCCCATTAGTCTTACAAAANGTCATGCCNTAAAATGCCAN 

GAAGGNCAGAAATGAATTTCTCACCGCCNGAGGAATGAGGATTATNCTGGGGGGACA 

TGCANAAATATTNNNCCCCCNATTTATTNATTTATTTATTTTTGAGACNGAGTNTCGNT 

CTAATCGCCCCCAGGCTGGNAGGTGGNAGGTGGTTCCCATCTTNAANCTTANNTNGGA 

AGGNCCTCTTTGNGCCCCNGGGGGGNGNAAAGNGAATTCCCTAAATGCCTNCANNCCCC 

TCCCTGGANGTTATTTGGGGGNNTTNTAAAGGGCNGTGGCNG 

Ref 4,1 

Sequence of BAC13 using primer C5S7, which spans 

nucleotides 2196-2205 of the cDNA. . Exon sequence is 

underlined and represents nucleotides 2225-2231. 

ACAAAAAC TAACCATCANTCTCTAAATCCCAACAANCTTTTTTTAAGAATACCTAANG 

AGCTCAACNAGGGGGACTNTCCAANGCACTTAAATGCAGNCAAACNACNCCNNCAAG 

AGNGGCAACTACTAATGGGGCANATCTNAAAGAAAATATAGNCAAAGGNNGGAATCA 

TAATAGGAGCNACCACTTANGAAGCACCAACTGGGGACCTGGAACTGNATNAGGNCC 

TCTACATACATCATNNNATANCATCCTGCAACGACCCCTGGAAGGAGANAGANGGNA 

TTCCTANNNTAGAGANGAGANAACTGGGGACATGGGAAGAGGNAAGCGAAGGGTTCA 

AGGGGANGNAAGCGAGCAGANNCCAGGGNCTCANACTNGNGGGGNNTGGGGGGNTN 

CTGNNNCCCTACNCTTNGNANGAACAGNGNNGTTGANNTGGCTTTNGANTA 

Ref 5.1 

Sequence of BAC13 using primer 122047F1, which spans 
nucleotides 3537-3556 of the cDNA. Exon sequence is 
underlined and represents nucleotides 3000-3492. 
This region does not contain a intron in this region 
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CCANNAGATTNTTGNAACGNNGGTAGGCTTCCTTTGTAGATTTATTGAAAATGTTTCGT 
ACTTCTACAAGTTTGCCCTGCCTTCCTATAAAGAGTGAAANTCANTTTGAATCGACTGG 
TGGATAATTNTTCCATTTTTCCTCAGGCAATTTNGAGTTACTCCAATAAATTCAAATAT 
GGGCCAGAGGAAATCATCTTTCAGATGGGCAGTGATTGGCCAGTCAGCAGGAGAAGC 
TGCTTATGCCTTGTTTGGTACATTGTGGAAAAACACACTTTAATAAATACGCANTCATG 
CCTGAGTACCCATCCTCCATCCCGCCACCCCCCAGTATGGCAAAAATCTGGTCAGAGT 
CCATTTGGGAATAATTCCATGGTTCCTGGGATCCCAAAGCTTCCAGAAGTGCTGGCTG 
ATCAANGGAGTGTACAGTCAGTCCTGGGTGGCAAAAATGTCCCATTTTTTAAGTACCA 
AGCAAAGGTTCCTTCTTNCAAGGGTTNCTAGGGCC 
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HC2A 

KIAA ASGNLDKNARFSAIYRQDSNKLSNDDMLKLLADFRKPEKMAKLPVILGNLDITIDNVSSD 

rat 

HC4 

HCl 

HC3 

HC5 



HC2A 

KIAA FPNYVNSSYIPTKQFETCSKTPITFEVEEFVPCIPKHTQPYTIYTNHLYVYPKYLKYDSQ 

rat 

HC4 

HCl 

HC3 

HC5 



HC2A VLHHHQNPEFYDEIK 

KIAA KSFAKARNIAICIEFKDSDEEDSQPLKCIYGRPGGPVFTRSAFAAVLHHHQNPEFYDEIK 

rat 

HC4 

HCl 

HC3 

HC5 



HC2A lELPTQLHEKHHLLLTFFHVSCDNSSKGSTKKRDVVETQVGYSWLPLLKDGRWTSEQHI 

KIAA lELPTQLHEKHHLLLTFFHVSCDNSSKGSTKKRDVVETQVGYSWLPLLKDGRWTSEQHI 

rat 

HC4 

HCl 

HC3 

HC5 



HC2A PVSANLPSGYLGYQELGMGRHYGPEIKWVDGGKPLLKISTHLVSTVYTQDQHLHNFFQYC 

KIAA PVSANLPSGYLGYQELGMGRHYGPEIKWVDGGKPLLKISTHLVSTVYTQDQHLHNFFQYC 

rat 

HC4 

HCl 

HC3 GPGPARSTVSISLISNSARV 

HC5 



HC2A QKTESGAQALGNELVKYLKSLHAMEGHVMIAFLPTILNQLFRVLT-RATQEEVAVNVTRV 

KIAA QKTESGAQALGNELVKYLKSLHAMEGHVMIAFLPTILNQLFRVLT-RATQEEVAVNVTRV 

rat 

HC4 MEIQVLIRFLSVILMQLFWVLPNMIHEDDVPISCPMV 

HCl MS FLPII LNQLFKVLV-QNEEDE I TTTVTRV 

HC3 NRSRSLSNSNPDISGTPTSPDDEVRSIIGSKGLDRSNSWVNTGGPKAAPWGSNPSPSAES 

HC5 
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Refs 



HC2A 

KIAA 

rat 

HC4 

HCl 

HC3 

HC5 



HC2A 

KIAA 

rat 

HC4 

HCl 

HC3 

HC5 



IIHVVAQCHEEGLESHLRSYVKYAYKAEPYVASEYKTVHEELTKSMTTILKPSADFLTSN 
IIHVVAQCHEEGLESHLRSYVKYAYKAEPYVASEYKTVHEELTKSMTTILKPSADFLTSN 

LFHIVSKCHEEGLDSYLSSFIKYSFRPGKPSAPQAPLIHETLATMMIALLKQSADFLAIN 

LPDIVAKCHEEQLDHSVQSYIKFVFKTR ACKERPVHEDLAKNVTGLLK-SNDSPTVK 

TQAMDRSCNRMSSHTETSSFLQTLTGRLP TKKLFHEELALQWWCSG — SVR E 

Cadherin 
Cl eava ge 



KLLRYSWFFFDVLIKSMAQHLIENSKVKLLRNQPFPASYHHAAETWNMLMPHITQKFGD 
KLLKYSWFFFDVLIKSMAQHLIENSKVKLLRNQFFPASYHHAVETWNMLMPHITQKFRD 



KLLKYSWFFFEIIAKSMATYLLEENKIKLTHGQRFPKAYHHALHSLFLAIT-IVESQYAE 
HVLK|HSWFFFAIILKSMAQHLIDTNKIQLPRPQPFPESYQNELDNLVMVLSDHVIWKYKD 
SALQQAWFFFELM^KSMVHHLYFNDKLEAPRKSPFPERFMDDIAALVSTIASDIVSRFQK 



HC2A 

KIAA 

rat 

HC4 

HCl 

HC3 

HC5 



NPEASKNANHSLAVFIKRCFTFMDRGFVFKQIN NYIS — CFAPGDE^KTLFEYKFEFL 

NPEASKNANHSLAVFIKRCFTFMDRGFVFKQIN NYIS — CFAPGDPKTLFEYKFEFL 

IPKESRNVNYSLASFLKCCLTLMDRGFVFNLIN DYIS — GFSPKDPKVLAEYKFEFL 

ALEETRRATHSVARFLkIrCFTFMDRGCVFKMVN NYIS — MFSSGDLKTLCQYKFDFL 

DTEMVERLNTSLAFFLNDLLSVMDRGFVFSLIKSCYKQVSSKLYSLPNPSVIf^SLRLDFL 



HC2A 

KIAA 

rat 

HC4 

HCl 

HC3 

HC5 



RWCNHEHYIPLNLPM- 
RVVCNHEHYIPLNLPM- 



-PFGKGRIQR- 
-PFGKGRIQR- 



-YQDLQL — 
-YQDLQL— 



-DYSLTDEF 
-DYSLTDEF 



QTICNHEHYIPLNLPM AFAKPKLQR VQDSNL EYSLSDEY 

QEVCQHEHFIPLCLPIRSANIPDPLTPSES TQELHASDMPEYSVTNEF 

RIICSHEHYVTLNLPCSLLTPPASPSPSVSSAT|SQSSGFSTNVQDQKIANMFELS — VPF 
MNADTAPTSPCPSIS SQtbTSSSCSSFQDQKIASMFDRTSRVPA 



1.1 



HC2A 

KIAA 

rat 

HC4 

HCl 

HC3 

HC5 



Cadherin 
EC motif 



CRNHFLVGI 
CRNHFLVGI 


LLRE 
LLRE 


\/GTALQEFRE VRLIAISVLKNLLIKHSFDDRYASRjSHQARIAT 

\^GTALQEFRE VRLIAISVLKNLLIKHSFDDRYASRSHQARIAT 


CKHHFLVGI 
CRKHFLtlGI 
RQQHYLAGI 
SSTS-SPGI 


LLRE 
LLRE 
V^LTE 
LFTE 


TSIALQDNYE IRYTAISVIKNLLIKHAFDTRYQHKNQQAKIAQ 

VGFALQEDQD VRHLALAVLKNLMAKHSFDDRYREPRKQAQIAS 

LAVILDPDAEGLFGLHKKVINMVHNLLSSHDSDPRYSDPQIKARVAM 
LAAALDAEGEGISEVQRKAVSAIHSLLSSHDLDPRCVKPEVKVKIAA 



HC2A 

KIAA 

rat 

HC4 

HCl 

HC3 

HC5 



LYLPLFGLLIENVQRINVRDVSPFPVNAG-MTVKDESLALPAVNPLVTPQKGSTLDNSLH 
LYLPLFGLLIENVQRINVRDVSPFPVNAG-MTVKDESLALPAVNPLVTPQKGSTLDNSLH 

LYLPFVGLLLENIQRLAGRDTLYSCAAMPNSASRDEFPCG FTSP — AN — RGSLS 

LYMPLYGMLLDNMPRIYLKDLYPFTVNTSNbGSRDDLSTNGGFQSQTAIKHANSVDTSFS 

LYLPLIGIIMETVPQLYDFTETHNQRGRPICIATDDYESE SG SMIS 

LYLPLVGIILDALPQLCDFTVADTRRYR TSGSDEEQE GA GAIT 
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2.1 



Refs 

HC2A KDLLGAISGljASPyTTSTPNINSVRNADSRGSLISTDSGNSLPERNSEKSNSLDP^lHQQSS 

j^j^A KDLLGAISGIASPYTTSTPNINSVRNADSRGSLISTDSGNSLPERNSEKSNSLDKHQQSS 

rat 

HC4 TDKDTAYGSFQNG HGIKREDSRGSLIP-EGATGFPDQGWTGEN TRQS 

Hcl KDVLNSI^IaFSS lAISTVNHADSRASLASLDSNPSTNEKSSEKTDNCEKIPRPL 

HC3 QTVAMAIAGTSVPQ LTRPGSFLLTSfcSGRQHT 

HC5 QNVALAIAGNNFN LKTSG-IVLSs|lPYKQYN 

HC2A TLGNSWRCDKLDQSEIKSLLMCFLYILKSMSDDALFTYWN-KASTSELMDFFTISEVCL 

KiAA TLGNSWRCDKLDQSEIKSLLMCFLYILKSMSDDALFTYWN-KASTSELMDFFTISEVCL 

rat 

HC4 STRSSVSQYNRLDQYEIRSLLMCYLYIVKMISEDTLLTYWN-KVSPQELINILILLEVCL 

Hci aligstlrfdrldqaetrsllmcflhimktisyetliaywq-rapspevsdffsiiJdvcl 

HC3 tfsaessrsllicllwvlkn-adetvlqkwftdlsvlqlnrlldllylcv 

HC5 mlnadttrnlmicflwimkn-adqslirkwiadlpstqlnrildllficv 

HC2A HQfIqYMGKRYIARNQEGLG— PIVHDRKS QTLPVSRNRTGMM 

K I AA HQ FQ YMGKRY I AR TGVM 

rat 

HC4 FHFRYMGKRNIARVHDAWLSKHFGIDRKS QTMPALRNRSGVM 

Hci QNFRYLGKRNIIRKIAAAF— KFVQSTQNNGTLKGSNPSCQTSGLLAQWMHSTSRHEGHK 

HC3 SCFEYKGKKVFERMNSLTFK— KSKDMRAK LEEAILGSIGARQEMV 

HC5 LCFEYKGKQSSDKVSTQVLQ— KSRDVKAR LEEALLRGEGARGEMM 

^^2A HARLQQL GSLDNS LTFNHSYGHSDADVLHQSLLEANIATEVC 

HARLQQL GSLDNS LTFNHSYGHSDADVLHQSLLEANIATEVC 

rat 

QARLQHL SSLESS FTLNHSSTTTEADIFHQALLEGNTATEVS 

fiCl QHRSQTLPIIRGK— NALSNPKL LQMLDNTMT|sNSNEIDIVHHVDTEANIATEGC 

Hc 3 rrsrgqi|ers P S GS AFG S QENLRWRKDMTHWRQNTEKLDKSRAE IEHE AL I DGNL ATEAN 

jiC5 RRRAPGNDRFP GLNENLRWKKEQTHWRQANEKLDKTKAELDQEALISGNLATEAH 

HC2A ltaldtlslftlafkInqlladhghnplmkkvfdvylcflqkhqsetalknvftalrsliy 

KiAA ltaldtlslftlafknqlladhghnplmkkvfdvylcflqkhqsetalknvftalrsliy 

klsrghsplmkkvfdvylcflqkhqsemalknvftalrsliy 

j^C4 ltvldtisfftqcfkthflnndghnplmkkvfdihlaflkngqsevslkhvfaslrafis 

j^ci ltildlvslftqthqrqlqqcdcqnslmkrgfdtymlffqvnqsatalkhvfaslrlfvc 

HC3 liildtleivvqtvs— vtes— kesilggvlkvllhsmacnqsavylqhcfatqralvs 

HC5 liildmqeniiqass— aldc—kdsllggvlrvlvnslncdqsttylthcf^tlralia 3.1 



HC2A kfpstfyegradmcaalcyeilkccnsklssirteasqllyflmrnnfdytgkksfvrth 

KiAA kfpstfyegradmcaalcyeilkccnsklssirteasqllyflmrnnfdytgkksfvrth 

j-at kfpstfyegradmcaslcyevlkccnsklssirteasqllyflmrnnfdytgkksfvrth 

HC4 kfpsaffkgrvnmcaafcyevlkcctskisstrneasallyllmrnnfeytkrktflrth 

HCI kfpsaffqgpadlcgsfcyevlkccnhrsrstqteasallylfmrknfefnkqksivrsh 

HC3 k^peLLFEEETEQCADLCLRLLRHCSSSIGTIRSHPSASLYLLMRQNFEIGN— NFARVK 

HC5 kFGDLLFEEEVEQCFDLCHQVLHHCSSSMDVTRSQACATLYLLMRFSFGATS— NFARVK 

HC2A lqviisvsqliadvvgigetrfqqslsiinncansdrlikhtsfssdvkdltkrirtvlm 

KiAA lqviisvsqliadvvgiggtrfqqslsiinncansdrlikhtsfssdvkdltkrirtvlm 

rat lqviislsqliadwgiggtrfqqslsiinncansdrlikhtsfssdvkdltkrirtvlm 

HC4 lqiiiavsqliadvalsggsrfqeslfiinnfansdrpmlarafpaevkdltkrirtvlm 

HCI lc^likavsqliad-agiggsrfqhslaitnnfangdkqmkn^nfpaevkdltkrirtvlm 

HC3 mqvpmslsslvgtsqnfneeflrrslktiltyaeedlelrettfpdqvqdlvfnlhmils 

HC 5 MQVTMS las L VGRAP DFNEEHLRRS LRT I LAYSEE DT AMQMT PFPTQVEELLCNLNS I L Y 
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HC2A 

KIAA 

rat 

HC4 

HCl 

HC3 

HC5 

SH3 

HC2A 

KIAA 

rat 

HC4 

HCl 

HC3 

HC5 



HC2A 

KIAA 

rat 

HC4 

HCl 

HC3 

HC5 



HC2A 

KIAA 

rat 

HC4 

HCl 

HC3 

HC5 



HC2A 

KIAA 

rat 

HC4 

HCl 

HC3 

HC5 

IT AM 

HC2A 

KIAA 

rat 

HC4 

HCl 

HC3 

HC5 



Transmembrane 



Refs 



ataqmkehendpemlvdlqyslaksyastpelrktwldsmarihvkngdlseaamcyvhv 
ataqmkehendpemlvdlqyslaksyastpelrktwldsmarihvkngjolseaamcyvhv 
ataqmkehendpemlvdlqyslaksyastpelrktwldsmari 
at aqmkehekdpeml i dlq y s l aks 
ataqmkehekdpemlvdlqyslansyastpelrrtwlesmakiharnGdlseIaamcyihi 
dt vkmkehqe dpeml i dlmyri akg yqt s pdlrltwlqnmagkhsers sthae aaqclvhs 
dtvkmrefoedpemlmdlmyriaksyqaspdlrltwlqnmaekhtkkk cyteaamclvha 



HVKNG DLSEAAMCYVHV 
3 Y AS T PE LRKTWLD S MAK I HVKNG DF S E AAMC Y VHV 



AALVAEYISMLED- 
AALVAEYISMLED- 



-frqgctafrvitpn 
■vqweppllphshsaclrrIsrggvfrqgctafrvitpn 
-lalqreppvfpyshtscqrkjsrggmfrqgctafrvitpn 

-FPNGCSAFKKITPN 
5ITPN 

-RKYLPVGCVTF^ISSN 
-HSYLPVGSVSFQNISSN 



TALVAEYITRKGV 

TALVAEYITRKEA 

TALVAEYITRKEAD 

AALVAEFIHRKKL 

AALIAEYIKRKGYWKVEKldTASLLSEDTHPCDSNSLLTTEjSGGSMFSMGWPAFLS 



IT AM 

IDEEASMMEDVG|MQD- 



-VHFNEDVLMELLEQCADGLWKAERYELIADIYKLI 



IDEEASMMEDVGMQD VHFNEDVLMELLE0CADGLWKAER |YELIADIYKLI| IPI 



IPI 



IDEEASMMEDVGMQD VHFNEDVLMELLEQCADGLWKAE RLRAG LLTSINSSSP 

IDEEGAMKEDAGMMD VHYSEEVLLELLEQCVNGLWKAER|YEII|SEISKLIGPI 

IKEEGAAKEDSGMHD TPYNEjMILVEQLYMCGEFLWKSER lyELll ADVNKPIIAV 

VLEESAVSDDVVSPDEEGICSGKYFTESGLVGLLEQAAASFSMAGMYEAVNEVYKVLIPI 
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SHLFPYVKKRIQVISQSSTELNPIEVAIDEMSRKVSELNQLCTMEEVDMISLQLKLQG SV 
SHAFPYIKTRVNVTHKEEIILTPIEVAIEDMQKKTQELAFATHQDPADPKMLQMVLQG SV 
MHAFPYIKTRISVIQKEEFVLTPIEVAIEDMKKKTLQLAVAINQEPPDAKMLQMVLQG SV 
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-Ill 

CGGTAACCGCCATTTTGTCTCCTGTAACAATTTACGCGCCGTGTAACTGTGAATCTTTCAAAGCCTCAGTTTTATGACC 
CTGTGGAGCCAGTGGACTTTGAAGGACTTCTG - 1 



1/1 

ATG ACA CAC CTG AAC 
Met thr his leu asn 
61/21 

GAG GAC TTG GAG GTG 
asp asp leu asp val 
121/41 

GAG GAA GGG GTT GAA 
glu glu gly val glu 
181/61 

TGG CTA ATC GTG AAC 
trp leu ile val asn 
241/81 

GGA TOT CGA AAA GAT 
gly ser arg lys asp 
301/101 

GAG TGC AGT GAA CCC 
glu cys ser glu pro 
361/121 

TCT GGG AAA GGC CCC 
ser gly lys gly pro 
421/141 

CGG CTA GAA AAC CTC 
arg leu glu asn leu 
481/161 

GCC CGG AGG ACC AAC 
ala arg arg thr asn 
541/181 

GAT GCT GTG GAA ATA 
asp ala val glu ile 
601/201 

TTG GTC AAG TTG CTG 
leu val lys leu leu 
661/221 

GCC CTC TAG GAT GTT 
ala leu tyr asp val 
721/241 

TCT GAC CAG TTC AAA 
ser asp gin phe lys 
781/261 

GCG AGA TCT GCA GTC 
ala arg ser ala val 
841/281 

ATT GAA AAA GTC CTG 
ile glu lys val leu 
901/301 

AAA GAA AGT GAT GGT 
lys glu ser asp gly 
961/321 

TCC TTC TGC CAG CGT 
ser phe cys gin arg 



AGC CTG GAT GTG CAG 
ser leu asp val gin 

GTG TTC ACG CCA AAG 
val phe thr pro lys 

CTG GAC CCT CAT GTC 
leu asp pro his val 

CGG AAA AAC CAA GGA 
arg lys asn gin gly 

TTT CAC AAG ACG CTT 
phe his lys thr leu 

GCT GCT CAG GCA GGC 
ala ala gin ala gly 

GTC ACT GCC TGT GAC 
val thr ala cys asp 

CTG CAG CAA GTG AGT 
leu gin gin val ser 

AGG CAG GCC GAG CTC 
arg gin ala glu leu 

CGT CCA GTA CCA GAA 
arg pro val pro glu 

ACC TTG AAG TTC GAG 
thr leu lys phe glu 

AAA GAA AGG AAA AAG 
lys glu arg lys lys 

GGA TTT CTG CGA GCT 
gly phe leu arg ala 

TTC TCA GTC ACC TAC 
phe ser val thr tyr 

CAG CAG GGA GAT ATT 
gin gin gly asp ile 

GGA AAG AGT AAA GAA 
gly lys ser lys glu 

TTG GGG AAA TAC CGG 
leu gly lys tyr arg 



31/11 

CTT GCC CAG GAG CTC 
leu ala gin glu leu 
91/31 

GAA TGT AGG ACT TTG 
glu cys arg thr leu 
151/51 

AGG GAC TGT GTT CAG 
arg asp cys val gin 
211/71 

AGT CCA GAA ATC TGT 
ser pro glu ile cys 
271/91 

CCG AAA CAG ACG TTT 
pro lys gin thr phe 
331/111 

CCC CGC CAC TTA AAC 
pro arg his leu asn 
391/131 

TTT GAC CTC CGC AGC 
phe asp leu arg ser 
451/151 

GCC GAG GAC TTT GAG 
ala glu asp phe glu 
511/171 

TTT GCC CTT TAC CCA 
phe ala leu tyr pro 
571/191 

TGT CCC AAG GAA CAC 
cys pro lys glu his 
631/211 

ATT GAA ATT GAG CCC 
ile glu ile glu pro 
691/231 

ATC TCA GAA AAT TTT 
ile ser glu asn phe 
751/251 

CAC ACG CCT TCA GTG 
his thr pro ser val 
811/271 

CCG TCC TCA GAC ATC 
pro ser ser asp ile 
871/291 

GGA GAC TGT GCA GAG 
gly asp cys ala glu 
931/311 

AAG ATT GAA AAA CTA 
lys ile glu lys leu 
991/331 

ATG CCC TTT GCC TGG 
met pro phe ala trp 



GGG GAC TTC ACT GAT 
gly asp phe thr asp 

CAG CCC TCT TTG CCG 
gin pro ser leu pro 

ACC TAC ATC CGT GAG 
thr tyr ile arg glu 

GGC TTT AAA AAG ACT 
gly phe lys lys thr 

GAG TCG GAA ACC TTG 
glu ser glu thr leu 

GTG CTG TGC GAC GTG 
val leu cys asp val 

CTG CAG CCT GAC AAG 
leu gin pro asp lys 

AAG CAG AAC GAG GAG 
lys gin asn glu glu 

TCA GTG GAC GAG GAG 
ser val asp glu glu 

CTG GGC AAC AGA ATA 
leu gly asn arg ile 

CTG TTT GCC AGC ATT 
leu phe ala ser ile 

CAC TGT GAC CTG AAC 
his cys asp leu asn 

GCC GCA TCA AGT CAG 
ala ala ser ser gin 

TAC CTG GTA GTC AAG 
tyr leu val val lys 

CCC TAC ACG GTT ATC 
pro tyr thr val ile 

AAA CTC CAA GCT GAA 
lys leu gin ala glu 

GCA CCC ATA AGC TTA 
ala pro ile ser leu 
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1021/341 

TCA AGC TTC TTC AAT 
ser ser phe phe asn 
1081/361 

GGG AGA AGC CCA GTG 
gly arg ser pro val 
1141/381 

GCC CTC TCC TTG GAG 
ala leu ser leu glu 
1201/401 

AGC AGC TTT TTC AAG 
ser ser phe phe lys 
1261/421 

GCT GAG TAC AAA AGA 
ala asp tyr lys arg 
1321/441 

AGA CTG GAG ATT TCT 
arg leu glu lie ser 
1381/461 

CCC GTG AAA CCC TTT 
pro val lys pro phe 
1441/481 

AC A CGA GAA GTA TAT 
thr arg glu val tyr 
1501/501 

AGG CTG AAC TTT GTA 
arg leu asn phe val 
1561/521 

ATG TGT GGA GAA GAT 
met cys gly glu asp 
1621/541 

GAA TTT CTG CAG GAA 
glu phe leu gin glu 
1681/561 

GAA GAA GTG AAA ATT 
glu glu val lys lie 
1741/581 

TTC TAC CAT ATC AGC 
phe tyr his lie ser 
1801/601 

TCA TGG CTG CCA ATT 
ser trp leu pro lie 
1861/621 

GCC TTG GAA AAA TTG 
ala leu glu lys leu 
1921/641 

AAT CCT CCC ATT AAG 
asn pro pro lie lys 
1981/661 

GTT TCT TCT GTA CAC 
val ser ser val his 
2041/681 

CTG GAG AGC CAG GTG 
leu glu ser gin val 
2101/701 

CTG GAG CAT GAG CTG 
leu glu his glu leu 



GTC TCC ACC CTT GAG 
val ser thr leu glu 

GGT GAA CGG AGG ACA 
gly glu arg arg thr 

GAA AAT GGG GTT GGA 
glu asn gly val gly 

CAG GAA GGA GAT CGC 
gin glu gly asp arg 

TCA TCA TCC TTA CAG 
ser ser ser leu gin 

ACA GCT CCA GAG ATC 
thr ala pro glu ile 

CCT GAA AAC CGG ACA 
pro glu asn arg thr 

GTC CCT CAC ACT GTG 
val pro his thr val 

AAC AAA CTA GCA TCA 
asn lys leu ala ser 

GCT AGC AAT GCG ATG 
ala ser asn ala met 

GTG TAC ACA GCT GTT 
val tyr thr ala val 

AAG CTC CCC GCT AAG 
lys leu pro ala lys 

TGT CAG CAG AAG CAA 
cys gin gin lys gin 

CTC TTA AAT GAA CGT 
leu leu asn glu arg 

CCA CCC AAC TAC TCC 
pro pro asn tyr ser 

TGG GCT GAA GGA CAT 
trp ala glu gly his 

ACC CAG GAC AAC CAC 
thr gin asp asn his 

ACC TTC CCC ATC CGC 
thr phe pro ile arg 

AAG CTC AGC ATC ATC 
lys leu ser ile ile 



1051/351 

AGG GAG GTA ACT GAT 
arg glu val thr asp 
1111/371 

TTG GCC CAA TCT AGA 
leu ala gin ser arg 
1171/391 

TCC AAC TTC AAA ACC 
ser asn phe lys thr 
1231/411 

CTT AGC GAT GAA GAC 
leu ser asp glu asp 
1291/431 

AGA CGA GTC AAG TCA 
arg arg val lys ser 
1351/451 

ATC AAT TGC TGT CTG 
ile asn cys cys leu 
1411/471 

CGC CCG CAC AAA GAG 
arg pro his lys glu 
1471/491 

TAC AGA AAC CTT CTC 
tyr arg asn leu leu 
1531/511 

GCC CGG AAC ATT ACA 
ala arg asn ile thr 
1591/531 

CCG GTC ATC TTT GGA 
pro val ile phe gly 
1651/551 

ACA TAC CAT AAT AAG 
thr tyr his asn lys 
1711/571 

CTC ACA GTA AAT CAC 
leu thr val asn his 
1771/591 

GGA GCC TCC GTG GAA 
gly ala ser val glu 
1831/611 

CTT CAA ACT GGA TCC 
leu gin thr gly ser 
1891/631 

ATG CAT TCT GCT GAG 
met his ser ala glu 
1951/651 

AAG GGA GTA TTT AAT 
lys gly val phe asn 
2011/671 

CTG GAG AAG TTC TTC 
leu glu lys phe phe 
2071/691 

GTG CTG GAT CAG AAA 
val leu asp gin lys 
2131/711 

TGC CTG AAC TCC TCC 
cys leu asn ser ser 



GTG GAC TCT GTG GTT 
val asp ser val val 

AGG CTT TCT GAA AGA 
arg leu ser glu arg 

TCC ACT CTG AGC GTT 
ser thr leu ser val 

TTA TTC AAG TTT TTA 
leu phe lys phe leu 

ATT CCA GGC TTG CTA 
ile pro gly leu leu 

ACT CCT GAA ATG CTG 
thr pro glu met leu 

ATT TTG GAA TTT CCA 
ile leu glu phe pro 

TAT GTC TAC CCA CAG 
tyr val tyr pro gin 

ATA AAG ATC CAG TTT 
ile lys ile gin phe 

AAA TCC AGC GGG CCT 
lys ser ser gly pro 

TCT CCT GAC TTT TAT 
ser pro asp phe tyr 

CAC CTC CTG TTC ACC 
his leu leu phe thr 

ACT CTC CTG GGA TAT 
thr leu leu gly tyr 

TAC TGT CTC CCA GTT 
tyr cys leu pro val 

AAA GTC CCA TTA CAG 
lys val pro leu gin 

ATT GAA GTG CAA GCT 
ile glu val gin ala 

ACC CTC TGC CAC TCC 
thr leu cys his ser 

ATC AGC GAG ATG GCG 
ile ser glu met ala 

CGC CTG GAG CCG CTC 
arg leu glu pro leu 
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2161/721 

GTG CTC TTC CTG CAC CTG 
val leu phe leu his leu 
2221/741 

ATC GCT GGC GAG ACA GCC 
lie ala gly gin thr ala 
2281/761 

AAC AGT CTG CAC AAC AGC 
asn ser leu his asn ser 
2341/781 

GCT TCC TAG GTG CAC TAG 
ala ser tyr val his tyr 
2401/801 

GGC GCT CCC ACT GCC CTC 
gly ala pro thr ala leu 
2461/821 

GCT GCT GTG AGT TCA AAG 
ala ala val ser ser lys 
2521/841 

GCG GGG ACA CAC TCC GCA 
ala gly thr his ser ala 
2581/861 

GAT CGC AAC TGC AGC CGA 
asp arg asn cys ser arg 
2641/881 

CCT GCA GCC CCA AGG CCA 
pro ala ala pro arg pro 
2701/901 

GTG GTC AGC ACC GGA ATG 
val val ser thr gly met 
2761/921 

GAC AGT TTT CGG AGG ACT 
asp ser phe arg arg thr 
2821/941 

AAT GTG GTC ACC TCG GAA 
asn val val thr ser glu 
2881/961 

GCG GAA AAG ATG AAC ATC 
ala glu lys met asn ile 
2941/981 

CGG GGC TTT GTG TTT AAC 
arg gly phe val phe asn 
3001/1001 

AAC CTT CCA ACG CTC ATT 
asn leu pro thr leu ile 
3061/1021 

CAT TAG CTC AAT CTG AAC 
his tyr leu asn leu asn 
3121/1041 

CCT TCC ATA TCT TCC CAG 
pro ser ile ser ser gin 
3181/1061 

AGC ATG TTC GAT CTG ACT 
ser met phe asp leu thr 
3241/1081 

ACA GAA CTG GCT GCT GCC 
thr glu leu ala ala ala 



2191/731 
GTG CTG GAC AAG CTC TTC CAG 
val leu asp lys leu phe gin 

2251/751 
AAC TTC TCC CAG TTT GCC TTC 
asn phe ser gin phe ala phe 

2311/771 
AAG GAC CTG AGC AAG GAC CAG 
lys asp leu ser lys asp gin 

2371/791 
GTC TTC CGC CTG CCA GAG GTG 
val phe arg leu pro glu val 

2431/811 
CTA GAC CCT CGG AGC TAG CAC 
leu asp pro arg ser tyr his 

2491/831 
CTG CTG CAG GCC CGG GTG ATG 
leu leu gin ala arg val met 

2551/851 
GCA GAC GAG GAA GTG AAG AAC 
ala asp glu glu val lys asn 

2611/871 
ATG TCT TAG TAT TGC TCT GGC 
met ser tyr tyr cys ser gly 

2671/891 
GCC AGC AAA AAG CAT TTC CAT 
ala ser lys lys his phe his 

2731/911 
GTG AAA AGC ATG GCC CAG CAC 
val lys ser met ala gin his 

2791/931 
CGT TTT TCT GAC CGT TTC ATG 
arg phe ser asp arg phe met 

2851/951 
ATT GCA GCC CTT TTA GTA AAA 
ile ala ala leu leu val lys 

2911/971 
AGC CTG GCT TTC TTC TTG TAT 
ser leu ala phe phe leu tyr 

2971/991 
CTC ATC AGA CAT TAT TGC AGC 
leu ile arg his tyr cys ser 

3031/1011 
TCC ATG AGG CTA GAG TTC CTG 
ser met arg leu glu phe leu 

3091/1031 
CTT TTT TTT ATG AAT GCT GAT 
leu phe phe met asn ala asp 

3151/1051 
AAC TCA AGC TCC TGC TCC AGC 
asn ser ser ser cys ser ser 

3211/1071 
TCC GAG TAG CGC CAG CAG CAC 
ser glu tyr arg gin gin his 

3271/1091 
CTG GAT GCC GAA GGG GAA GGA 
leu asp ala glu gly glu gly 



CTG TCC GTG CAG CCC ATG GTC 
leu ser val gin pro met val 

GAG TCC GTG GTG GCC ATC GCC 
glu ser val val ala ile ala 

CAT GGG AGG AAC TGC CTG CTG 
his gly arg asn cys leu leu 

CAA AGG GAT GTG CCC AAG TCA 
gin arg asp val pro lys ser 

ACG TAT GGC CGC ACA TCA GCT 
thr tyr gly arg thr ser ala 

AGC AGC AGT AAC CCA GAC CTC 
ser ser ser asn pro asp leu 

ATC ATG TCT TCA AAG ATC GCC 
ile met ser ser lys ile ala 

AGT AGT GAT GCT CCA AGT TCA 
ser ser asp ala pro ser ser 

GAG GAG CTT GCC CTT CAG ATG 
glu glu leu ala leu gin met 

GTA CAT AAC ATG GAC AAA CGG 
val his asn met asp lys arg 

GAT GAC ATA ACT ACT ATT GTT 
asp asp ile thr thr ile val 

CCA CAG AAG GAA AAT GAA CAG 
pro gin lys glu asn glu gin 

GAC CTT CTC TCC CTC ATG GAT 
asp leu leu ser leu met asp 

CAG CTG TCA GCC AAG CTC AGT 
gin leu ser ala lys leu ser 

AGA ATC CTC TGT AGC CAT GAG 
arg ile leu cys ser his glu 

ACT GCT CCA ACA TCT CCT TGT 
thr ala pro thr ser pro cys 

TTC CAG GAC CAG AAG ATC GCC 
phe gin asp gin lys ile ala 

TTC CTC ACC GGG CTC CTC TTC 
phe leu thr gly leu leu phe 

ATC AGC AAA GTA CAA AGG AAA 
ile ser lys val gin arg lys 
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3301/1101 

GCT GTC AGT GCA ATT CAC AGC 
ala val ser ala ile his ser 
3361/1121 

CCA GAG GTG AAG GTC AAA ATC 
pro glu val lys val lys ile 
3421/1141 

GCT TTG CCA CAG CTC TGT GAC 
ala leu pro gin leu cys asp 
3481/1161 

TCG GAT GAA GAA CAA GAA GGA 
ser asp glu glu gin glu gly 
3541/1181 

GGG AAT AAT TTC AAT TTG AAA 
gly asn asn phe asn leu lys 
3601/1201 

TAC AAC ATG CTG AAC GCG GAC 
tyr asn met leu asn ala asp 
3661/1221 

AAA AAT GCT GAT CAG AGC CTC 
lys asn ala asp gin ser leu 
3721/1241 

AAC AGG ATT TTA GAT CTA CTT 
asn arg ile leu asp leu leu 
3781/1261 

AGT TCT GAC AAA GTC AGT ACC 
ser ser asp lys val ser thr 
3841/1281 

GAA GAG GCT TTG CTG CGT GGG 
glu glu ala leu leu arg gly 
3901/1301 

GGG AAC GAC CGA TTT CCA GGC 

gly asn asp arg phe pro gly 
3961/1321 

TGG CGG CAA GCT AAT GAG AAG 
trp arg gin ala asn glu lys 
4021/1341 

ATC AGT GGC AAT CTG GCT ACA 
ile ser gly asn leu ala thr 
4081/1361 

ATC CAG GCG AGC TCG GCT CTG 
ile gin ala ser ser ala leu 
4141/1381 

CTG GTG AAT TCT CTG AAC TGT 
leu val asn ser leu asn cys 

4201/1401 

CTC CGT GCT CTC ATC GCC AAG 
leu arg ala leu ile ala lys 
4261/1421 

TTC GAC CTA TGT CAC CAA GTC 
phe asp leu cys his gin val 
4321/1441 

CAA GCC TGT GCC ACC CTT TAC 
gin ala cys ala thr leu tyr 
4381/1461 

GCA AGA GTA AAG ATG CAA GTA 

ala arg val lys met gin val 



3331/1111 
CTG CTA AGT TCT CAC GAC 
leu leu ser ser his asp 

3391/1131 
GCC GCC CTT TAC CTA CCT 
ala ala leu tyr leu pro 

3451/1151 
TTT ACA GTT GCA GAT ACT 
phe thr val ala asp thr 

3511/1171 
GCC GGT GCC ATT AAC CAG 
ala gly ala ile asn gin 

3571/1191 
ACA AGT GGA ATA GTG CTG 
thr ser gly ile val leu 

3631/1211 
ACT ACT CGC AAC CTC ATG 
thr thr arg asn leu met 

3691/1231 
ATT AGG AAG TGG ATT GCT 
ile arg lys trp ile ala 

3751/1251 
TTC ATC TGT GTG TTA TGT 
phe ile cys val leu cys 

3811/1271 
CAA GTC CTG CAG AAG TCA 
gin val leu gin lys ser 

3871/1291 
GAA GGG GCC AGA GGG GAG 
glu gly ala arg gly glu 

3931/1311 

CTA AAT GAA AAT TTG AGA 

leu asn glu asn leu arg 

3991/1331 
CTA GAT AAA ACA AAG GCC 
leu asp lys thr lys ala 

4051/1351 
GAA GCA CAT TTA ATC ATC 
glu ala his leu ile ile 

4111/1371 
GAC TGT AAA GAC AGC CTG 
asp cys lys asp ser leu 

4171/1391 
GAT CAG AGT ACC ACC TAC 
asp gin ser thr thr tyr 

4231/1411 
TTT GGA GAC TTA CTC TTC 
phe gly asp leu leu phe 

4291/1431 
CTG CAC CAC TGC AGC AGC 
leu his his cys ser ser 

4351/1451 
CTC CTC ATG AGG TTC AGT 
leu leu met arg phe ser 

4411/1471 
ACC ATG TCC CTG GCA TCT 
thr met ser leu ala ser 



CTG GAC CCA CGC TGT GTC AAA 
leu asp pro arg cys val lys 

TTA GTT GGC ATC ATT TTG GAT 
leu val gly ile ile leu asp 

CGC AGA TAC CGC ACC AGT GGC 
arg arg tyr arg thr ser gly 

AAT GTG GCT CTG GCC ATA GCA 
asn val ala leu ala ile ala 

TCT TCC TTG CCC TAT AAG CAG 
ser ser leu pro tyr lys gin 

ATC TGC TTC CTC TGG ATC ATG 
ile cys phe leu trp ile met 

GAC CTG CCA TCA ACG CAG CTC 
asp leu pro ser thr gin leu 

TTT GAG TAT AAG GGA AAA CAG 
phe glu tyr lys gly lys gin 

AGG GAT GTC AAG GCC CGG CTG 
arg asp val lys ala arg leu 

ATG ATG CGC CGC CGG GCT CCA 
met met arg arg arg ala pro 

TGG AAG AAA GAG CAG ACA CAT 

trp lys lys glu gin thr his 

GAG TTA GAT CAA GAA GCC TTG 
glu leu asp gin glu ala leu 

CTG GAT ATG CAG GAA AAC ATT 
leu asp met gin glu asn ile 

CTG GGA GGT GTT CTG AGG GTG 
leu gly gly val leu arg val 

CTG ACT CAC TGC TTT GCA ACA 
leu thr his cys phe ala thr 

GAA GAG GAG GTG GAA CAG TGT 
glu glu glu val glu gin cys 

AGC ATG GAT GTC ACC CGG AGC 
ser met asp val thr arg ser 

TTT GGA GCC ACC AGT AAT TTT 
phe gly ala thr ser asn phe 

TTG GTG GGA AGA GCA CCA GAC 
leu val gly arg ala pro asp 
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4441/1481 

TTT AAT GAA GAG CAC 
phe asn glu glu his 
4501/1501 

ACA GCC ATG GAG ATG 
thr ala met gin met 
4561/1521 

AGC ATC TTA TAT GAG 
ser ile leu tyr asp 
4621/1541 

GAT CTC ATG TAG AGA 
asp leu met tyr arg 
4681/1561 

CTC GAG AAC ATG GCA 
leu gin asn met ala 
4741/1581 

CTG GTG CAC GCC GCT 
leu vai his ala ala 
4801/1601 

CTG GCC GTG GGC AGT 
leu pro val gly ser 
4861/1621 

GTG TCT GAG GAG ACC 
val ser glu asp thr 
4921/1641 

GAG AGT GGC CTG GTA 
glu ser gly leu val 
4981/1661 

TAT GAG ACA GTT AAT 
tyr glu thr val asn 
5041/1681 

TTC GGG AAG CTG ACA 
phe arg lys leu thr 
5101/1701 

AAG GAT CAT AAG AGA 
lys asp his lys arg 
5161/1721 

GGG GAT TTG GAT GAA 
gly asp leu asp glu 
5221/1741 

ATC TCA CAT AGA CTA 
ile ser his arg leu 
5281/1761 

ATT AAA GAC TCC ACT 
ile lys asp ser thr 
5341/1781 

ATC ACT TTT GTG GAG 
ile thr phe val glu 
5401/1801 

GAG AAG AAT TTC AAC 
glu lys asn phe asn 
5461/1821 

GCT CGG GGA GAG CTG 
pro arg gly glu leu 
5521/1841 

TTC CCC TAG ATC AAG 
phe pro tyr ile lys 



CTG AGA AGA TCC TTG 
leu arg arg ser leu 

ACT CCT TTT CCC ACC 
thr pro phe pro thr 

ACA GTG AAA ATG AGG 
thr val lys met arg 

ATT GCC AAG AGT TAG 
ile ala lys ser tyr 

GAG AAA CAC ACC AAG 
glu lys his thr lys 

GCG TTA GTG GCT GAG 
ala leu val ala glu 

GTG AGC TTC GAG AAT 
val ser phe gin asn 

CTG TCA CCT GAC GAG 
leu ser pro asp glu 

GGC CTC CTG GAG GAG 
gly leu leu glu gin 

GAG GTG TAG AAG CTG 
glu val tyr lys leu 

CTC ACT CAC AGC AAG 
leu thr his ser lys 

ATG TTT GGA ACC TAC 
met phe gly thr tyr 

CAG GAG TTT GTC TAC 
gin glu phe val tyr 

GAG GCA TTT TAT GGT 
glu ala phe tyr gly 

CCT GTG GAC AAA ACC 
pro val asp lys thr 

CCC TAC TTT GAT GAG 
pro tyr phe asp glu 

CTC CGG AGG TTC ATG 
leu arg arg phe met 

CAT GAG CAG TAC AGA 
his glu gin tyr arg 

ACC AGG ATC AGC GTC 
thr arg ile ser val 



4471/1491 

AGG ACA ATT TTG GCC 
arg thr ile leu ala 
4531/1511 

CAG GTG GAG GAA CTT 
gin val glu glu leu 
4591/1531 

GAA TTT CAG GAA GAT 
glu phe gin glu asp 
4651/1551 

CAG GCA TCT GGT GAT 
gin ala ser pro asp 
4711/1571 

AAG AAG TGC TAC AGG 
lys lys cys tyr thr 
4771/1591 

TAT CTG AGC ATG CTG 
tyr leu ser met leu 
4831/1611 

ATT TCT TCC AAT GTG 
ile ser ser asn val 
4891/1631 

GAT GGG GTG TGC GCA 
asp gly val cys ala 
4951/1651 

GCC GCG GAG CTC TTC 
ala ala glu leu phe 
5011/1671 

GTC ATC CCC ATC CTA 
val ile pro ile leu 
5071/1691 

CTG CAG AGA GCC TTC 
leu gin arg ala phe 
5131/1711 

TTC GGA GTT GGT TTC 
phe arg val gly phe 
5191/1731 

AAA GAG CCT GCA ATT 
lys glu pro ala ile 
5251/1751 

GAA TGT TTT GGT GGA 
gin cys phe gly ala 
5311/1771 

AAG TTG GAT CCT AAC 
lys leu asp pro asn 
5371/1791 

TAT GAG ATG AAA GAC 
tyr glu met lys asp 
5431/1811 

TAC ACC ACC CGG TTG 
tyr thr thr pro phe 
5491/1831 

AGG AAC ACA GTC CTG 
arg asn thr val leu 
5551/1851 

ATC CAG AAG GAG GAG 
ile gin lys glu glu 



TAT TCA GAA GAG GAC 
tyr ser glu glu asp 

CTC TGT AAT CTG AAT 
leu cys asn leu asn 

CCT GAG ATG CTT ATG 
pro glu met leu met 

CTG CGG CTG ACC TGG 
leu arg leu thr trp 

GAG GCT GCC ATG TGC 
glu ala ala met cys 

GAG GAC CAC AGC TAC 
glu asp his ser tyr 

CTG GAG GAG TCT GTG 
leu glu glu ser val 

GGC CAG TAC TTC ACC 
gly gin tyr phe thr 

AGC AGG GGA GGC TTA 
ser thr gly gly leu 

GAA GCG CAT GGA GAA 
glu ala his arg glu 

GAC AGC ATC GTT AAC 
asp ser ile val asn 

TTT GGA TCC AAA TTT 
phe gly ser lys phe 

ACC AAG CTT CCT GAG 
thr lys leu pro glu 

GAA TTT GTG GAA GTG 
glu phe val glu val 

AAG GCC TAC ATA CAG 
lys ala tyr ile gin 

AGG GTC ACA TAC TTT 
arg val thr tyr phe 

ACC CTG GAG GGG CGG 
thr leu glu gly arg 

ACC ACT ATG CAC GCC 
thr thr met his ala 

TTT GTT TTG ACA CGG 
phe val leu thr pro 
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5581/1861 
















5611/1871 
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GCC 
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GAA 
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AAG 
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AAG ACC CTG 
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GTT 
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ATT 
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ile glu val 
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glu 
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lys thr leu 


gin 


leu 
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val 
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asn 


5641/1881 
















5671/1891 
















CAG GAG CCG 


CCT 


GAT 


GCA 


AAG 


ATG 


CTT 


CAG 


ATG GTG CTG 


CAA 


GGC 


TCT 


GTG 


GGA 


GCT 


ACT 


gin glu pro 


pro 


asp 


ala 


lys 


met 


leu 


gin 


met val leu 


gin 


gly 


ser 


val 


gly 


ala 


thr 


5701/1901 
















5731/1911 
















GTA AAT CAG 


GGA 


CCA 


CTG 


GAA 


GTA 


GCC 


CAA 


GTG TTT TTG 


GCT 


GAA 


ATT 


CCT 


GCT 


GAT 


CCA 


val asn gin 


gly 


pro 


leu 


glu 


val 


ala 


gin 


val phe leu 


ala 


glu 


ile 


pro 


ala 


asp 


pro 


5761/1921 
















5791/1931 
















AAA CTC TAT 


CGA 


CAT 


CAC 


AAC 


AAG 


TTG 


AGG 


TTA TGC TTT 


AAG 


GAA 


TTC 


ATC 


ATG 


AGA 


TGT 


lys leu tyr 


arg 


his 


his 


asn 


lys 


leu 


arg 


leu cys phe 


lys 


glu 


phe 


ile 


met 


arg 


cys 


5821/1941 
















5851/1951 
















GGT GAA GCT 


GTA 


GAG 


AAA 


AAC 


AAG 


CGT 


CTC 


ATC ACG GCA 


GAC 


CAG 


AGG 


GAA 


TAT 


CAG 


CAG 


gly glu ala 


val 


glu 


lys 


asn 


lys 


arg 


leu 


ile thr ala 


asp 


gin 


arg 


glu 


tyr 


gin 


gin 


5881/1961 
















5911/1971 
















GAA CTC AAA 


AAG 


AAC 


TAT 


AAC 


AAG 


CTA 


AAA 


GAG AAC CTC 


AGG 


CCA 


ATG 


ATC 


GAG 


CGG 


AAA 


glu leu lys 


lys 


asn 


tyr 


asn 


lys 


leu 


lys 


glu asn leu 


arg 


pro 


met 


ile 


glu 


arg 


lys 


5941/1981 
















5971/1991 
















ATT CCA GAA 


CTG 


TAC 


AAG 


CCA 


ATA 


TTC 


AGA 


GTT GAG AGT 


CAA 


AAG 


AGG 


GAC 


TCC 


TTC 


CAC 


ile pro glu 


leu 


tyr 


lys 


pro 


ile 


phe 


arg 


val glu ser 


gin 


lys 


arg 
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ser 


phe 
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6001/2001 
















6031/2011 
















AGA TCT AGT 


TTC 


AGG 


AAA 


TGT 


GAA 


ACC 


CAG 


TTG TCA CAG 


GGC 


AGC 


TAA 










arg ser ser 


phe 


arg 
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cys 


glu 


thr 


gin 


leu ser gin 


gly 


ser 


OCH 











GAAAAGCCATCTTCATTCGTGGAGACTGTGGCCCTGCAACCCTGGAGAAGGACTTGCTGGTACTTAAAAAATGGGACATT 
TGCCACCCAGGACTGACTGTACACTCCCTGATCAGCCAGCACTCTGGAAGCTTTGGGATCCCAGGAACCATGGAATTATT 
CCCAAATGGACTCTGACCAGATTTTTGCCATACTGGGGGGTGGCGGGATGGAGGATGGGTACTCAGGCATGACTGCGTAT 
TTATTAAAGTGTGTTTTTCCACAATGTACCAAACAAGGCATAAGCAGCTTCTCCTGCTGACTGGCCAATCACTGCCCATC 
TGAGAGATGATTTCCTCTGGCCCATATTTGAATTTATTGGAGTAACTCAAATTGCCTGAGGAAAAATGGAAAAATTATCC 
ACCAGTCGATTCAAACTGAATTTCACTCTTTATAGGAAGGCAGGGCAAACTTGTAGGAGTACGAAACATTTTCAATAAAT 
CTACAAAGGGAAGCCTTACTACAATTCCAAAAATCATCATGGTTGGAAATTTGGGAGGAGATTATTTGTGAACTTGTTAC 
CCTTTTGGTAATGGTGGACTAATTGCTGTATAGTTATTTTTGTTTTATTATTACTGTTACATTAATTTAACATGCATTTA 
TAGAAGAATACATTCAAAGCACTGATGTAGGAGATACACGGTACTTGGAGCAGTCAGCCAAAAATCACAGATACTGCTTT 
CACTTAAATGGAAACAATTCTCCGATAATGCTTTGCTTTTTTTCTTATGTCACTCTTGTGTACTATCTATTTTTCTCCTC 
TCTGGGACCAAGTTTCTTTTTATAAAGCAATAATATCTCTGTTTTCATTTCAGAACATTGTGCTGTCTGTCAGCATATGT 
ATATCAGCTACAAAATATATTCAACTTTGACTTCTTTTGACAAAGGACTTTAGGAAAAGGAGGAACAAAGACATTATTTG 
AGAATTAAATTATATATTTTTAATATGACTGTGACCTTGACTGATAATAAAGATGTAATAAGAATTGCAAGCTAAAAAAA 
AAAAAAAAAAAACTCG 
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A. Allelic variations: single nucleotide changes (polymorphism) between CLASP-5 



isof orms 



Isof orm 


Nucleotide (s ) 


Consequence 


1 


1727 


C to T change; mis-sense 
mutation changing codon from 
alanine to valine 


2 


1749 


A to G change; silent mutation 


3 


2277 


G to C change; silent mutation 


4 


2853 


C to T change; silent mutation 


5 


3427 


A to G change; mis-sense 
mutation changing codon from 
lysine to glutamic acid 


6 


3777 


C to T change; silent mutation 



Isof orm 


,j.vc; o j-^ -L j_ t= o 

Difference 


Nucleotide (s) 


Consequence 


1 


exon deletion 


1806-1944 


premature, in-frame stop codon 
leading to the production of a 
truncated, most likely soluble 
protein 


2 


exon insertion 


between 2857 
and 2858 


additional, in-frame 4 8 nucleotide 
exon that contains a stop codon at 
the second codon, which would 
lead to a truncated, most likely 
soluble protein 



These differences may be found separately or together in various combinations 
different human CLASP-5 isoforms 
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FIG. 6C 



1st partial exon (nucleotides 3793 to 3952) 

CCAGCTGTCAGCCAAGCTCAGTAACCTTCCAACGCTCATTTCCATGAGGCTAG 

AGTTCCTGAGAATCCTCTGTAGCCATGAGCATTACCTCAATCTGAACCTTTTT 

TTTATGAATGCTGATACTGCTCCAACATCTCCTTGTCCTTCCATATCTTCCCAG 

GTAATAAAAGAATTATTTAACTAAAAGAATTATTCAAGCTAT 

2nd exon (nucleotides 5809 to 5948) 

GCTCATAAAATGGCTCCTTACGTTTCTGTAG AACTCAAGCTCCTGCTCCAGCT 
TCCAGGACCAGAAGATCGCCAGCATGTTCGATCTGACTTCCGAGTACCGCCA 
GCAGCACTTCCTCACCGGGCTCCTCTTCACAGAACTGGCTGCTGCCCTGGATG 
CCGAAGGGGAAGGGTATGTTTCTGGCATTTAAAATGGAAGATGAAGC 



3rd exon (nucleotides 13652 to 13831) 

CATAACCTCTTGATTCCTGTGTTGTGCCAACAG AATCAGCAAAGTACAAAGG 
AAAGCTGTCAGTGCAATTCACAGCCTGCTAAGTTCTCACGACCTGGACCCAC 
GCTGTGTCAAACCAGAGGTGAAGGTCAAAATCGCCGCCCTTTACCTACCTTTA 
GTTGGCATCATTTTGGATGCTTTGCCACAGCTCTGTGACTTTACAG GTAATGG 
CCCTTCTGTTTTCTTTCTTGGATTG 

4th exon (nucleotides 16948 to 17087) 

TGTTTGACTTGACATCACAAACGATGTTTTCATTGCAG TTGCAGATACTCGCA 
GATACCGCACCAGTGGCTCGGATGAAGAACAAGAAGGAGCCGGTGCCATTA 
ACCAGAATGTGGCTCTGGCCATAGCAGGGAATAATTTCAATTTGAAAACAAG 
TGGAATAGTGCTGTCTTCCTTG GTATGTTGGTGCACATGTGTCTGGTTGATTTT 

TCAT 



5th exon (nucleotides 19281 to 19463) 

TGGCCTCCATCCCCCAATCTGCCTCCCTTCAG CCCTATAAGCAGTACAACATG 
CTGAACGCGGACACTACTCGCAACCTCATGATCTGCTTCCTCTGGATCATGAA 
AAATGCTGATCAGAGCCTCATTAGGAAGTGGATTGCTGACCTGCCATCAACG 
CAGCTCAACAGGATTTTAGATCTACTTTTCATCTGTGTGTTATGTTTTGAGTAT 
AAGGTAAGTCTGGAGTGGCACAACTTTATACCAGC 



6th exon (nucleotides 19829 to 19958) 

CACCAAAGGACATGTCCTCCTACCTCTGTCTTGTCCAG GGAAAACAGAGTTCT 
GACAAAGTCAGTACCCAAGTCCTGCAGAAGTCAAGGGATGTCAAGGCCCGG 
CTGGAAGAGGCTTTGCTGCGTGGGGAAGGGGCCAGAGGGGAGATGATGCGC 
CGCCGGGCTCCAGGTGTGTTGGACTGGCCCTTCCCTGCTCTCTGTCAAGC 



FIG. 7A(1 of 4) 



7th exon (nucleotides 20928 to 21015) 

TCAAATTCCTATCATGCATTTCTTAACTCCTAG GGAACGACCGATTTCCAGGC 
CTAAATGAAAATTTGAGATGGAAGAAAGAGCAGACACATTGGCGGCAAGCT 
AATGAGAAGCTAGATAAGTGAGTCACTCGGCAACTTTCTGCTACTTTTACCT 



8th exon (nucleotides 25755 to 25861) 

GCTTTAATTTGACCTCTTGTTGTTTCCTAG AACAAAGGCCGAGTTAGATCAAG 
AAGCCTTGATCAGTGGCAATCTGGCTACAGAAGCACATTTAATCATCCTGGA 
TATGCAGGAAAACATTATCCAG GTGAGGAAAACAAACACCCAATCTGATTTG 

9th exon (nucleotides 27242 to 27375) 

GGATTCAATGATGCTGTTCTTCCATTCCCCCAG GCGAGCTCGGCTCTGGACTG 
TAAAGACAGCCTGCTGGGAGGTGTTCTGAGGGTGCTGGTGAATTCTCTGAAC 
TGTGATCAGAGTACCACCTACCTGACTCACTGCTTTGCAACACTCCGTGCTCT 
CATCGCCAAGGTAAACTTGGGATGCTTGTTTTCTTCCTCTTAATT 



10th exon (nucleotides 28582 to 28734) 

AGTGATGCCTAATGGCCCTTTATGTCTCTCCTAG TTTGGAGACTTACTCTTCG 
AAGAGGAGGTGGAACAGTGTTTCGACCTATGTCACCAAGTCCTGCACCACTG 
CAGCAGCAGCATGGATGTCACCCGGAGCCAAGCCTGTGCCACCCTTTACCTC 
CTCATGAGGTTCAGTTTTGGAGCCACCAGT GTAAGAGTTCAAACCAGCTGAG 
TGACCTGGAATCAG 

11th exon (nucleotides 31046 to 31204) 

TTACTTCATCTTTTTTTTTTTTTTCACTGATGCAG AATTTTGCAAGAGTAAAGA 
TGCAAGTAACCATGTCCCTGGCATCTTTGGTGGGAAGAGCACCAGACTTTAA 
TGAAGAGCACCTGAGAAGATCCTTGAGGACAATTTTGGCCTATTCAGAAGAG 
GACACAGCCATGCAGATGACTCCTTTTCCCACCCAG GTACACCGAAGCACAT 
ACCTTGTCTCATGCATGAGT 

12th exon (nucleotides 32755 to 32855) 

Anr.TAAr^ATTATTTTGAGGCTTACACTTTTTGCAG GTGGAGGAACTTCTCTGT 
AATCTGAATAGCATCTTATATGACACAGTGAAAATGAGGGAATTTCAGGAAG 
ATCCTGAGATGCTTATGGATCTCATGTACAG GTAAGCTTTCCTGACACACTCA 

AGGGACACCATTT 

13th exon (nucleotides 33653 to 33855) 

TCCTCAAAACTACTTCTCACTCAATCTGTCTTCAG AATTGCCAAGAGTTACCA 

ggcatctcctgatctgcggctgacctggctccagaacatggcagagaaacacac 
caagaagaagtgctacacggaggctgccatgtgcctggtgcacgccgctg 
cgttagtggctgagtatctgagcatgctggaggaccacagctacctgccc"gt 
gggcagtgtcagcttccag gtagggtgtgtgcagcttttcccttagagcagtg 

GTTC 
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14th exon (nucleotides 38125 to 38268) 

rTnTTr.TnCAnGCTTATACTGTGGTCTCTTTCAG AATATTTCTTCCAATGTGCT 
GGAGGAGTCTGTGGTCTCTGAGGACACCCTGTCACCTGACGAGGATGGGGTGTG 
CGCAGGCCAGTACTTCACCGAGAGTGGCCTGGTAGGCCTCCTGGAGCAGG 
CCGCGGAGCTCTTCAGCACG GTCAGTGCCCAGAGGGCATCCCGGGGCCTGGC 

C 

15th exon (nucleotides 40166 to 40297) 

BBT-Tr^PTrTnaTGr/rr.TTrTrCTCTTTCCAAG GGAGGCTT ATATGAGACAGT 
TAATGAGGTCTACAAGCTGGTCATCCCCATCCTAGAAGCGC ATCGAGAATTC 
CGGAAGCTGACACTCACTCACAGCAAGCTGCAGAGAGCCTTCGACAGCATCG 
TTAACAAGGTAGCCGGGGAGCCTGGCTGGCAGGTCTTGTTAC 



16th exon (nucleotides 40755 to 40889) 

rraar-r^ar.acrTTTTTAT&TTTTGTTCCTCAG GATCATAAGAG AATGTTTGGAA 
CCTACTTCCGAGTTGGTTTCTTTGGATCCAAATTTGGGGATTTGGATGAACAG 
GAGTTTGTCTACAAAGAGCCTGCAATTACCAAGCTTCCTGAGAT CTCACATAG 
ACTAGAGGTAAGAAAAGTGATTCTGTGCGCCTGACCTGGTACACTTTAC 



17th exon (nucleotides 42307 to 42396) 

aarrTTTaTaftAr.TGTTGGTTCTTCTTACCTAG GCATTTTATGGTCAATGTTTT 
GGTGCAGAATTTGTGGAAGTGATTAAAGACTCCACTCCTGTGGACAAAACCA 
AGTTGGATCCTAACAAGGTATACAAAAATTTACAAAAACTAACCATCAAGC 



18th exon (nucleotides 45250 to 45486) 

TrTTr.Tr.CCTCCGTGCCTTTTCCCCCTTAG GCCTACATACAGATCACTTTTGTG 
GAGCCCTACTTTGATGAGTATGAGATGAAAGACAGGGTCACATACTTTGAGAAGA 
ATTTCAACCTCCGGAGGTTCATGTACACCACCCCGTTCACCCTGGAGGG 
GCGGCCTCGGGGAGAGCTGCATGAGCAGTACAGAAGGAACACAGTCCTGAC 
CACTATGCACGCCTTCCCCTACATCAAGACCAGGATCAGCGTCATCCAGAAG 
GAGGAGGTAATGCACCCAAGGGATTGGCCACCACTGGATGAGT 



19th exon (nucleotides 48664 to 48807) 

7vn7Nr:irrrzar-TTPrr'TaTr;TTTarGTCTr.ATGTTCAG TTTGTTTTGACACCGATTG 
AAGTTGCCATTGAAGACATGAAGAAGAAGACCCTGCAGTTAGCAGTTGCCAT 
TAACCAGGAGCCGCCTGATGCAAAGATGCTTCAGATGGTGCTGCAAGGCTCTGT 
GGGAGCTACTGTAAATCAGGTAAGCAAAACCAGAGGTGGCAGCTCCT 



20th exon (nucleotides50892 to 50998) 

Taq^aTTrTTTTTTTTTTTTTTTTTTTTTTTCCCACCAG GGACCACTGGAAGTAGC 
CCAAGTGTTTTTGGCTGAAATTCCTGCTGATCCAAAACTCTATCGACATCACAACA 
AGTTGAGGTTATGCTTTAAGGAATTCATCATGAG GTAAGAAGGAAAATG 
GCTGGGAATTTCAGTAGAG 
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21st exon (nucleotides 62398 to 52568) 

TCATTTATTTCTCCCACACTGATATTTTCATCTCAG ATGTGGTGAAGCTGTAG 
AGAAAAACAAGCGTCTCATCACGGCAGACCAGAGGGAATATCAGCAGGAAC 
TCAAAAAGAACTATAACAAGCTAAAAGAGAACCTCAGGCCAATGATCGAGC 
GGAAAATTCCAGAACTGTACAAGCCAATATTCAGAGTTGAGAGTCAAAAGAG 
GTAAGAACAGGGCAGAGGAGGCCTCTTCCTGTGGGAT 

22nd exon (nucleotides 63040 to 63294) 

CCTCCCTCTCTTTTCTTAATTTCAG GGACTCCTTCCACAGATCTAGTTTCAGGA 
AATGTGAAACCCAGTTGTCACAGGGCAGCTAAGAAAAGCCATCTTCATTCGT 
GGAGACTGTGGCCCTGCAACCCTGGAGAAGGACTTGCTGGTACTTAAAAAAT 
GGGACATTTGCCACCCAGGACTGACTGTACACTCCCTGATCAGCCAGCACTC 
TGGAAGCTTTGGGATCCCAGGAACCATGGAATTATTCCCAAATGGACTCTGA 
CCAGATTTTTGCCATACTGGGGGGTGGCGGGATGGAGGATGGGTACTCAGGC 
ATGACTGCGTATTTATTAAAGTGTGTTTTTCCACAATGTACCAAACAAGGCAT 
AAGCAGCTTCTCCTGCTGACTGGCCAATCACTGCCCATCTGAGAGATGATTTC 
CTCTGGCCCATATTTGAATTTATTGGAGTAACTCAAATTGCCTGAGGAAAAAT 
GGAAAAATTATCCACCAGTCGATTCAAACTGAATTTCACTCTTTATAGGAAG 
GCAGGGCAAACTTGTAGGAGTACGAAACATTTTCAATAAATCTACAAAGGGA 
AGCCTTACTACAATTCCAAAAATCATCATGGTTGGAAATTTGGGAGGAGATT 
ATTTGTGAACTTGTTACCCTTTTGGTAATGGTGGACTAATTGCTGTATAGTTAT 
TTTTGTTTTATTATTACTGTTACATTAATTTAACATGCATTTATAGAAGAATAC 
ATTCAAAGCACTGATGTAGGAGATACACGGTACTTGGAGCAGTCAGCCAAAA 
ATCACAGATACTGCTTTCACTTAAATGGAAACAATTCTCCGATAATGCTTTG C 
TTTTTTTCTTATGTCACTCTTGTGTACTATCTATTTTTCTCCTCTCTGGGACCAA 
GTTTCTTTTTATAAAGCAATAATATCTCTGTTTTCATTTCAGAACATTGTGCTG 
TCTGTCAGCATATGTATATCAGCTACAAAATATATTCAACTTTGACTTCTTTTG 
ACAAAGGACTTTAGGAAAAGGAGGAACAAAGACATTATTTGAGAATTAAATT 
ATATATTTTTAATATGACTGTGACCTTGACTGATAATAAAGATGTAATAAGAA 
TTGCAAGCTAAAAAAAAAAAAAAAAAAA 
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GTTCTCTGTGGTTAGTCACTTAGTGACTTTAGATAAGTTTTTCCAATTTTATGGGTCTTAATTTCCTCAGTTTTAAAATA 

AGAAGGGGGGGTTGAGAGATTTGAGGGCTGATCAACGAAAAGGATAGGACCATAAAAAGCAGTGACATACAAGCTTCATT 

GAGCAGCACTTGGACAGGGTTACATAAGAGCGGAAGCCCCTCCCAGCATGAGAACAGCCATAGGCCTGCAGTGAGGAGGG 

GACCATCCAGAGGAGCAGGGGAACTCCCAGGGGAGAGGAGGATTAGGGCAGAAGCTTATAGATCTGGGTGAGGCTGCTCC 

ACAGCACAGTAGGGAGTCTCTGGGTCAGAGAGCTCCAAGGGCTGTAGCAGCTTAGGGCCCTGTATCTGCAAGGCTCTATC 

TTATCATTAGGAAACAGCTGTCCATCGAGATTTTATGGGGTCTGCAAAGGAGGCAGGCACTAAATGGATGAAAATCTGCT 

TATATGAGCTATTTTTGTTAATGACTGGGCATGGTAAAATTTGAGTTAGGTATGAGCCAAGGGATCAGCACCAGTCGACA 

GTGAAGAAATAAACAACCGAGGGGCCAATACACAGAAACAATCTTTGACTTGTTTACATAACACCTCTCCTCCAAAAAAA 

GAAAAAGACATGACTTTTCTAAAGGAAGTCCTGTCTACAGCCATACCACCCTGAACGCACGCAGTCTCATCTAAGGGAAG 

TCCTGATAAGGAAGCAGAATAATAAAAAAGAATCCTAAGGAAACATCAGAAGTCCCCAAGCATCCCCATCCCATGCACCC 

TGACCCCTGCCCTGCAGCGGATCTTCTGTCCCAGGACCCACCAGAATAGAATGGCAGAGGGACACTTCACTCCTCCTTGC 

CCTCCCTCTTCAGTATTTAGGATTCCAGGTTAGTCTGTTTTCAAAACTTTAAGTTGAGTGTATGAAAGATACCCTAGATC 

ACCACTGCAGACCCAGGCTTAGCTACTCACATGCAGGCTATCTTCATTCCCGACAGGGAAATAAGGCAACCCAAGGTAGA 

TATCTGCCTTCCCTGCAAAACTCATGTTTTTATTTTCTCTTTCTCTTTCCTTTCTTTTCTCTTGCTCACTATCTTTCTCC 

TTAAATTCTTCCCTTTTCTTTTAGTTACCATTTCTTTTTTTGTTTCTCTCCTTGACTCAAATGCCAAAGATCCCTAGACC 

AGGAGTCAAGAAGCCTGCATGCTGATTCCTGGGACACCATTAATTACTTCTGTGATTGGGAGCATCAGCCTTTGAGGCAT 

TCATTTCCTCATCTGCAAAAAAACTAGGCTGGATTAGATTTATCCACTGATTCTGTGGTCTGTGTCTGCCAGTGACATCC 

ACGGATGTTACTTAGCACCATTAGTGGCACTCAGGCCTCAGAAGGTCACTGACCCCCATTCGTGGTGATTTAATTCATTG 

ATCCCAGCTCTCTAGATACAGGATGAACTTCATACCTTAAGCAAGTTGTATTCTTACAAAGTCGTCTGACTTTATCATTT 

TGCATAACCTATTATGTTTTCTGCCATCTAAAGATTTTGTGGCAATGGATATAACCTTAGTCTTTTTTAATAGTCACTAA 

AAATAGACAAATCCATTTCCTCAATTACTGTCTGTCACTTACAATGTGCCACTAAGCAAACACCCATCCCTTGGGTAGGG 

GCATTGTGGGTCTTGACCTTTGGGAAGGAAGTTTTTGGAATGCACCTTACTCCTCTTTTCAAAACATCACTCAAATTGTT 

TTTGAAATGTCTATTTTTTTTCTTTTGAGATGGAGTCTTACTCTGTTGCCCAGGCCAGAGTGCAGTGGCACAATTTTGGC 

TCACTGCAAACTCCGCCTCACAGGTTCAAGCGATTCTCCTTCCTCACCCTCCCGAGTAGCTGGGATTACAGGTGTGTGGC 

ACCACTCCCCGCTAATTTTTGCATTTTTAGTAGAGACAGGGTTTCACCATGTTGGCCAGACTGGTCTTGAACTCTGGCCT 

CAAGTGATCTGCCTGCCTCTGCCTCCCGAAGTGCTGGGATTACAGGTGTGAGCCACCACACCCAGCCTGAAATGTCTGTC 

TTGAATGATGTCTCAAGGGACCTATGAAAGATACCCATAGTGGGGCCTTCTTTTAAGTGCCAATGTGTTGTGGGTTCAAG 

TTCCGATAGCCGGCTTGACCCGACACCTGTTAATGAGTAACCTAAGTGACAGGCACATGACCAAGTTCTAATCCCTTCAA 

TGTGCTGGTGGCTCCACTGGTCCAAAGTCAGCCAGGAGTGCACATCGAAAGGTTATGGGATCTGGTAACTGTGCTTACAT 

AGAAGTCATATGTTTTGGTTTTAAAATAATATATAATGGCATTTACTTATTTTAAGTGGATGTCTAACTATGAATTAATT 

CTGTAGGCAATATGTCCCACAACACATTGGCTTCTTGTAAAATGGCTGAAAATATGTGTTCATTTAAATTATATTGTTTA 

GTCTGTAATCCCAGCACTTTGGGAGGCCAAGGCGGGAGGATCACCTGAGGCCAGGAGTTCGAGACCAGCTTGGCCAATAT 

GGTTAAACCCCATCTCTACTAAAAATACAAAAAATTAGCTGGGCATGTTGATGTGCACCTGTAATCCCAGCTACTCGGGA 

GGCTGAGGCAAGAGAATCGCTTGAACCTGGGAGATGAAGGTTGCAGTGAGCCGAGATCATGCCACTGCACTCCAGTTCAA 

CAGAGCAAGACTCTGTATCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTATATATATATATATATATATATACATAT 

ATATATATGTGTATATATATATGTGTATATATATATATGTATATATATATATATGTGTATATATATATATGTGTATATAT 

ATATATATATAGTTTAAAAAACAAACTACTCTTGAAGTGAGTGAAGAGGAGGCATGTCTATCACAAGGATGATGCTTCAT 

ATTTCTGTGCTGGGGTGGGGGGTGATAATGATGAAATATTGAGGAGCTCAAGGTGCATCAGCGCCCCTTCCTTCCCCTAT 

TTTGCCTTTAGAGGAGCTGCCTCTGAGTCTGGGTCTTTCAGTTGTTCAGCTTGCCTGGGGGAAGCAAGGGGTCTCTTCCA 

GACAGTCACCTTTTTCTGCTTTTCATTGCTTGCTTCATGGTTTATTTTTTAAAGGAAGATTTTTCCTAAAAACTCTTCTA 

GCTTTCTTTCTCTTTCTTCTCCATTTCCTCCTCTCTCAGTCTGGGACAAGCTCCTTGAGTTTTTTGCAGGTACTATGTAC 

TTTGCTAAACAGTTTGATGCCTTTCTCTAGGGAACTGGTTCTACAACTTTCCAATGGGGCCTTTAATTAGAAACTACGAG 

AGAACACTTGTAGTATAAAAGTCATCTAGTCATATTCTATTAGTTTCATACAGGCTCATATGAGGTCAACTCCTTTCATT 

TAGTTTCTCGAACATAGTAGAGTTTTTGTAAAATTAATTATGTTACGGTGAAGATGTACCTCAAGATTTTCAGCACAGGC 

TTCCCATGGTATTAAAGATTTGATAAAGTGGAAATCGGGATAAGAATCCTCATTCTGGACAGCTACTAGGCTAGAATCAC 

TAAGGGGAACAGTAATGAATGAAAATTTATTAGACCTCTCTGTAATGCAGAATTTACCATCTGTTGCACTATCCCATTAG 

TTTTTATTAAATTGATTGCCTTAACCTGGAGAGAAAGGATATTTTTGTGTCTGCCAACCTCAACTCCACTTACCTTGTAA 

TAAATGTTTCATTCCTCTTTTCATTTTTCTTCCCAGCTGTCAGCCAAGCTCAGTAACCTTCCAACGCTCATTTCCATGAG 

GCTAGAGTTCCTGAGAATCCTCTGTAGCCATGAGCATTACCTCAATCTGAACCTTTTTTTTATGAATGCTGATACTGCTC 

CAACATCTCCTTGTCCTTCCATATCTTCCCAGGTAATAAAAGAATTATTTAACTAAAAGAATTATTCAAGCTATTTCATT 

TAACTAGCTCAGTTTAATCATGTATTTCCTATAAAGGTTAGTCTTATTAATTTGACAAAAAATCAAACAATTCAAACCAG 

ATCAAGTATGCTACCCTGAAGTTACACCACTAGCTAAGAATTAACAATCTAAGTAATTGGTTTCTCCCCAGGCTCAAGGC 

TCCCTGATCAGGTTAAGTAAAGCCAAGAATCCAATAAGCCCTATGAAATTTAGAAACTCATAGAAAAGTCTCAAATCTTC 

CTTTGTCTGACATTAGCCAATTGTTATATTATGCAAATAGAGGATTCCAAAGTAAATAAGTTTTGGAAACCATGTATTCA 

CCAGGTTTCTTTGCAGCAGGCCCTCTAAGAGGTTTTAACTTGTTCATGTCAATTGCAAATGCACGAGAAGCAGAAGAGGA 

AATGGGAGGAGGTGTGCATGATAAGCAGTGATTCCCCAAATGATTTAACATGGAAACCCCACCCCACCTCATACATTTTT 

TGAAATGTTACTCTGGAATAACTTTAAGAACATGCTGTTTGATAGTCACATTCCATTTATAGGGAACACATCAGTAGCTC 
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AGAGCATCAGGCAAAATTTCTCATTATTAGGTTATATTTCTGTTGCATATTCCTTGATACTAGTACAAAAGTGAAGGCTT 
GTCTTACTAATTGAAAAAAAAATCTTAGCCATATATGCCATATGGCATGATCCAGATATTAGCTACATGACCATCTTACT 
GTGAACAGGGAAAGATCTGACTCACAAGCAGCAATTCAAAATGTATAAACTTTTTGCTCCATCCTGCCCAGCAGCTTAAG 
TCCTAAAACTTCCCTGGACTAAGCTTACCCTAGGTTCTTTCCCACCTTCCCCTTCTGACTGCCCCAGGGAGTGGCCAAAC 
CCAGGGGCCAGGCTCACAAAACCATGAAGGATTTCTAAAGACACTTTAGATGCTCTTAATGAAATATAAAGTGTGCTCCC 
AGGATACAAATACAGGACAGGAATTACTGAGGACCGGTAAATCTAATACTTCCCTCCCTGACATCACTTGTAGTTCCAGG 
CCAGCAAAAGTCTGACAATGTGCTTAAGCCAAATTCAGAAGTGTAGCTGAGGCCGGGCACGGTGGCTCACATTTGTAATC 
CCAGCATCTTCGGAGGCCAAAGGGAGTGGAATACTTGAGGCAGGAGTTACCAGCCTGACCAACATGATGAAAACTCATAT 
CTACTAAAAATACAAAAATGCATCAGGTGTGGTAGTGTGACTGTAATCCCAGCTACTTGGGACGCTGAGGCATGAGAATT 
GCTTGAACCCGGGAGAGGGAGGTTGCAGTGAGCTGAGACCATGCCACTGCAATCCAGCCTGGGTAACAGAGTGACACTCT 
GTCTTTCAAAAAAAAAAAAAAAAAAAAAAGAAGGTACCTGAGTAAGCAGGGCCTTAAACAAAGGGGGCATTTGGTTACAG 
AGGAATTGCCTGTTCCCAGCCCCAGGACTGGGTGAGGTTCTTATTTCTCTGTCCAACTTTTATGCTAGGATTTTTATCTT 
CAGCTTTTGATTTCAGGTGGGAAAGGGAGCGTCATTCATTTTTCAACAGACCCGGGGTGCTTTATTTTCATCTTTCCCGC 
TGGGCAGTCTTCTCTCTGTGGGTATGGGAAAGTGCAGTAACCACTCTGTGCCTCAGTTTCCTTGTCCGCCTTATCTGGCA 
CCAGAGTACCTACCTCACTGGGGAGGGCTCACGAAGCCTGGCTGGGGCGAGGACTCAGTAAACACTGGCCATCGCTATTT 
TCATTCCAGTTCTTGTGGCTCATAAAATGGCTCCTTACGTTTCTGTAGAACTCAAGCTCCTGCTCCAGCTTCCAGGACCA 
GAAGATCGCCAGCATGTTCGATCTGACTTCCGAGTACCGCCAGCAGCACTTCCTCACCGGGCTCCTCTTCACAGAACTGG 
CTGCTGCCCTGGATGCCGAAGGGGAAGGGTATGTTTCTGGCATTTAAAATGGAAGATGAAGCCAAAAAAACAGATGTTCT 
TTAATAAAATTTGCAGTCTAGCTTCTCACACTTGGTAAAAAACTCTACTGTAGTTGACCAGTTCTGAGGAGTAGAAACAT 
CTGTCTTGAGAATATGGTACCCATAAGCACAAGGCACAAGAAAGGCCTTTCTTGTGTAGAAAGGCACCAGGGATGGGTAA 
GAACTACAAAATGACTTTTCTTGGTCAACTATTTCAGTGGAATTTACCAGTTCTGCTATAGCAGGTTTCCCAAGGATGCT 
TTGATTAGTGAACTCCCTAGGAGCAAAGCCATTTTTAACAAAGGGGATAGCATGCAGAGGCAACCACAAGATGTCACTTG 
GTTCAAAGCTGATGAAGGAAATAATGGCTGCTGAGAAGGCAGCTGTCCCATGCCCAGATTAGGTTTCTTGCAGACAGTGC 
TTCTCAGGCCAAGAGACACCACACATCCAGGGCTTCTGAATCCTTTTACCTCTTATCTATCTGATTACTCAGGGACATGT 
GGCAGTATCTAGCCTAGAAGTCAACAGACAGAGAGGTAGACCACCCCCTTCTTTCCTTCTCTCCCTATGCTCCGTGAGCT 
CATGGAGTCAGAAACCCACAGCCTATCTGATTGGACTGAAAAAGATAATGCCTTCTAAAATATTATTCATTCCGTTCAAC 
AATTATTGAATGCCTTCCATGGGACAGACACAGTTAGGTGTTAGAGTACATCAGAGATATATCAGTGCACAAAACAGACA 
AAATTCGCTGCCCCTCAAGAGCTCACCATCCAGGGACTCAGATTCAAAGATGATTAGGATTGCAAAGTAGTTGAGATTCA 
ACACTTCTGGTGTTTAGATTAGGCATACAGAATTGCAGGAACTTGCCCACTATTCCATAACAGAGAACGAAATCTAAAGC 
CATGGTCTTCCCAGTCCCTTCCCAGGACCCACTCTCCTATACCAAACCATCTCCCTGCACAGGGAACCTGTTCCCATGAC 
TCCTATGCTAAAGAGGTTTCCAGAATAATGTTTGCTAATTAATGGTAGCATTGCTGTATGTTATGGATGGATTTTTATCT 
AAATGGAATTTCTCAAGTTCTCTCAAAGTGCTTACAATTTAGTGAAGGAAACAGGAGGTTCCTGCATGAAACAAATACAG 
GAGCACATCGAGTTCGTGCAACTGCAACAGTCACATGTGTCCCAAGCTATGTACCCAGGCTGACACCGCTGAAGTGACTC 
TATGTCTGCGTCTCCTGGCAGGATAGTGTGGGGAATCTCATGCTTTTAGCTCTCAATTCTGCCTCCTTCCAGATAAACTG 
GCCTGAGTATATCCTTTGAGAACTTCACTTTCCATGGCTTCAATTCTATCTTCTCTTCTCAACTCTAACTAAATTTCTCC 
CAATTATCTCAGTAGAATGTTCTACCCTACAACTAACATAAATTTCCACAGCAACAAAAAGTGCACCGAACACTTATGCT 
AATAAGTAAGATACTGAGAAGAAAGTTTGAACACAAAGAAAATTGCCTTCATGCACAAACATGTACATACATTTTCTTAG 
TTGTCCTTTAATAGCAGTACTTTAAGTGATTTCTAGAAACATCTTTAGTATTTACAATAGCGTAGTTTCTATTTTCTATT 
TTCATTCTAGCTGGAAACAGCCATGACATTCTGTTCTGGATTCCTTGTAAAATTGTTGCTGTTATATTACTAGCAACAAG 
GTAGAGTATATTCAGAGATAATCATGTAATTATGTTTAATCAGGTAGATACATTCTTCAAACACACACACACACACACAC 
ACACACGCACACACGCGCGTGCACATGCACACACACAGTACCCTTTCTCCCCCAAAAGAGTAATTAACAGTGTAACTCCT 
TTGGCAACACAGTAATCCCTGATTGCTGGGTTGTCAGTTACTCTCCTGGAAAGTCATTAGATACACTGTCACAACTTCAT 
CATGTAAGCTAAAACCTCAGAAAAATGTCATGCTTCAACAATACTTCCTATTGATCAGCACTTCCTTTTTTTTTTTCTTT 
TGCTTCCAAGACAGTTGGTAGACTTGCCAACACCTGTACATCCATGGGGCAAGGGCCAGAAGGACGCATCTCAGTACCTG 
AACCCCTAGGGAGCTACAGCTCCAGTTTCCACTGGGTTTCCCTTGGATCTCTGACACCTGGGTATATCCACAAGTGTCTA 
TGCATCGTTAAATCCACAGTGTTTTTCAAGCATTTAGAAATTGTCTATCATCTTTGTGTTAAGCCAGCAAATTGCAATGC 
CTAATTCAAAACACAACCCATTGCCGGGACCACAGACTAAGAACAAGAAAAACTTTTGAAATGCAATTTACAATTATCTT 
ACTTTAGCCACAGTGCAAGAGTCTGAGTCATTTAAAATTTTGGTTAATATTTTCTATATAACGTTGAATTCTGATGTAGC 
CTTATTTTGTTTGAAAGAAAAAAATGTATATATATGTATATATTTTTTAAATCTGGATTCTTTTTTTATTATTATCATAC 
TTTAAGTTCTAGGGTACATGTATACAACATGCAGGTTTGTTACATATGTATATATGCGCCATGTTGGTGTGCTGCACCCA 
TTAACTCGTCATTTACATTAGGTATATCTCCTAATGCTGTCCCTCCCCACTCCCCCCACCCCATGACAGTCCCCGGTGTG 
TGATGTTCCCACCTATGAGTGAGAACATGCGGTGTTTGGTTTTCTGTCCTTGCGATAGTTTGCTGAGAATGACGGTTTTC 
AGCTTCATCCATGTCCCTACAAAGGACATGAACTCATCCTTTTTTATGGCTGCATAGTATTCCATGGTGTATATGTGCCA 
CATTTTCTTAATCCAGTCTACCATTGATGGACATTTGGGTTGGTTCCAAGTCTTTGCTATTGTGAATAGTGCCGCAATAA 
AAGGATTATAAATCATGCTGCTATAAAGACATATGCACACATATGTTTATTGCGGCAAGATGTATATTTTTAAAATTGAA 
CATCTATGACTAGGTTCCAGATACTCCACTATATATTTTATTTACTTTTTATTATAGTATAATTTATATAAAGAACCTGA 
ATCCTAAGTGTAAAAGTGCGATTAATATTTCCTGTGAATATATTTGAGTAACCACCAGTCAGATAAAAAATATAATATTC 
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CATTATCCTCTCTCAGTTACAGGCCCCCCTCAAGTAACCACTATTCTGACTCTTATTATTAGAAATTAATACTGCCTGTT 

GTTGAATTTCATAGTCTTTTGCTGAATGTTGTGACTGTGACATCCTTTAGTGTGGTCGCATGTGTCAGTATTCATTTTTT 

TTCAACTGTATGTCATATTCCTTTGTCTACTATAATTTCTCTTCTGTAAATTGACATTTGGGCTGCTTTCTATTTGTGGG 

TATTGGGTATTATGAAAACAGCTGCCGTGAACATGCCTGTGCATGGTTTTGGGTGGACGTTAGAACTCATTTCTTTGGGG 

CTATAAATACAGCCTATTTTTTATTTTAATATACTGCTCTTGAATAGTTTAATAAATATGTGTACATGGTCTTAACAAAA 

TGTCAAAAGAATATACTCTGAGCTAGGAAAAGAAGAGCAAACAAGTCAAAGCAGGAAGATGGCAGGGAATAACAAAGGTG 

ATAGCCAAAATAAATGAAATAAAGAATAGAAAAACAATCACGGAAATCCGCAAGATGAAAAGCTTGTTATTTGAAAAGAG 

CAACAAAATTCACCAATCTTTAGCTGAGCTGACCAAGAAAAAAGGAAGAAGACTCAATTACTAAAATCATAATTGAAAGA 

TTCAACACAATCATATCACAAGAGACCTTACAGAAATAAAAAGGATTATAAAAGAATACGATGAACAATTGAAAGCCATC 

AAATTGATAACCTAGATTAAATGGATAAATTCCTTAAAAGGTACAAAGTACTAAAATTGACTCCAAGAAGATATAGAAAA 

TCCAAATAGACCTACAGAAGTAAAAAGATTGAGTTAGTAATCAAACTTCCCACATACACCTACTATGTACCCACACAAAT 

TAAAAATTTAGGCTGGGCGCAGTGGCTCATACCTGTAATCCCAGCACTTTGGGTGGCCAAGGCGAGTGGATCACCTGAGG 

TCAAGAGTTCAAGACCAGCCTGGCCAAGGTGATGAAACCCCGTCTCTACTAAAAATACAAAAATTAGCTGGGTGTGGTGG 

CGGGAACCTGTAATCCCATCTACTCGGGAGGCTGAGGCAGAAGAATCATTTGAGACTGGAAGGCAGAGGTTGCAGTGAGC 

CAAGATCATGCCAATGCACTCCAGCCTGGGCAACAAGAGCAAAACTCCATCACAAATAATAATAATAATAATAATATATT 

TTAAAATTTAAAACTTCCTACAATAAAAGCTCAAACCTGGGGGGCTTTACTGATGAATTCTACCAAATATTTTTAAAAGA 

ATTAATTCTAATTTTTTACCAACTTCCAGTCTTCTCTTCCAACGAATGGAAGAGGTGGAATACTTCCCCACTTGTTCTAT 

GAAGCTAGCATTACCCTATACTAAACCAGACAAAGACATCATGAGAAAACTACAGGCCAGTATCTGATGAATATAGATGT 

AAGACCCTCAACAAACACTAGCAAACTGAATCCAACAGCATATAAAAAGGATTATACACCATGGCTAAGTAGGATTTATC 

TCAGGAATGCAAGATAGGCTGCATACCTGAAAATCAATTGTTGTACCATATTAATAAAATAAAGGACAAAACCCATACAA 

TCATCTTAGTAGATGCAAAGAAAAGCATTTAATAAAATCTAATAACGCTTCCTGATAAAAACACTCAACAAACCTTTTAG 

GAAATAAGAGAACTTCCTCAACTTGACTTAAGGGCCTCTATGAAAAATCCACAGCTAATGTGACACTTATTAGTGAAAAA 

CAGTGCTTTATCCCTAAGATTAGGAACAAGACAAAAATGTCTACCCTTGCCACTTCTATTCAACATATTAGGAGTTCTAT 

CTAGGGCAATTAGGCAAAAAAATAAAACAAAAGACATCTAGGCCAGGCGTGGTGGCTCACGCCTGTAATCCCAGCACTTT 

GGGAGGCCAAAGTGGACAGATCGCTTTGAGCCCAGGAGACTGAGAACACCCTGAGCAACATGGCAAAACGCCATCTCTAC 

AAGAAATACAAAAATTAGCTGGGCATTGGTGGCTTGTGTTTGTAGTCCCAGCTACTTGGGAGGTTGAGGCTGGAGAATTG 

CTTGATCCCAGAAAGCGGAGGTTGTAGTGAGCTGAGATCACGCTACTGCACTCCAGCCTGGGCCACAGAGTAAGACCCTG 

TCTCAAAAAAAAAAAAAAAAAGAAAAAGAAGAAAAGAAAAGAAAGCATTTAAATTGGAAAAGAAGTAAAACTATCTCTAT 

TCATAGGTGGCATAATCTTGTTTATAGAAAACCATAAGGAATCCACAAAAAACTCCATTACAACTAATAAATGAATTCAG 

CAGTGTTGCATGGTATAAGATCAACATACAAGAATCAATTGTGTTTCTATACACTTACGATGAGCAATCTGAAAATGAAA 

TTAAGAAAACAATTTCATATAAAATAGCATCACAAAGAAAAAATATTTAGGAATAAATGTAACAAAAGAAACACAAGAGT 

TATACACTAAAAATGACAAAACACTGTTGAAAGAAAGATATAAATAAATGGAGGATATCATATGTTCATGAATCAGAAGA 

CTTATTATTAAAATAGCAATACTCCCCAAATTGATCCATAGATTAAATGCAGTTCTTCTCAGAATTCTAGCTTGCTTTTT 

TTTTTTTTTGGCAGAAATTAGCAAACTGATCCTAAAATTCGTGTGGAAATTCAAGGGACCCAGTATAGCCAAAACAACCT 

TGAAAAACAAGAACAAAATTGGAGGACTCACACTTCCCAATTTCAAAACTTACTACAAAGCAAAAGTAGTCAAGACTATG 

GGGTTCTGACATATGATAGACATATAGATCAATGAAATTGGGTTAAGAGTCCAAAAATAAATCTTCATATTTATAGTCAA 

TTGATTTTTGACAAGAGTGCCAAAACAATTCAATGGGGGAAAATAGAATTTTCAATAAATGGTGTTGGGACAACTGGGTA 

TCCACACTCAAAAGAATGAAGTTGGACCCTATATTACACTGTATACAAAAACTAACTCAAATAGATCAAAGACCTAAATG 

TAAGAGCTAAAACTATAAAATTGTTACATAAAATTATAGAGGTAATCATCATAGACTTAGAAAAGGCAGTGGTTTCTTAG 

ATATGACACACTCGAAAGTATGAGTAACAAGAAAAAAATAGATAACTGGACTTCAGTAAAATTAAAACTTTTGTGATTTA 

TAGGACACCATCAAAAT^ATGAAAAGGCAACACACAAAATGGGAGAAAATATTTGCAAATCAAAAACCTAATAGGGGACT 

TGTATCTGGAATATATATTTTAAAATCTTACAACTCAGTAATAAAAAGACAAATAACTCAGTTTTTTAAAAGGCAAAAGA 

TCAGAATAGACATTTCTCCAAAGAAGATACAGCCATAAGACCATGAAGATGTTCAGCATCATTAGCCGTCAGGGAGATGC 

ATATTAAAATCACAATTAAATACCACTTGATACCCACGAAGATGGATATAATAAAAAAGACAGGTAATAAAGTGTTGGCA 

AGAATAAAATGGAGTCTTCAGACACTGCTGGTGGGAATGTAAAATTGTGCAGCCACCGTTGAAAACAACTTGCTGATTCC 

TCTAAAAGTTAAACAGAGGCTGGGCGCTCGGCGGCTCACGCCTATAATCTCAGCACTTTGGGAGGCTGAGGTGGGCAGAT 

CATTTGAGGCCAGGAGTTCGAGACCAGCCTGGCCAAGATGGTGAAACCCTGTCTCTACTAAAAATACAAAAATTAGCCAG 

GTGTGGTGGCAGGTGCCTGTAGTCCCCGCTACTTGGGAGGCTGAGTCAGAAGAATTGCTTGAACCCAGGAGGTGGAGGTT 

GCAGTGAGCCGAGATCGTGTCATTGCACTCCAGCCTGAACAACTCCATGTCAAAAAAAAAAAAAAGTTAAACAGACAGTT 

ACCATACAAGCCAGCAAATGTACTCTGAGGTATGTACCCAAGAAAAGTAAAACTTAAAACTTGTATACACATACTCATAG 

CAGCGTTGGTAAGTCACAATAGCTCAAAAGCAGAAACAATCCAAATGTTTATCAGTTGATGAATGGATAAAATTCACCAA 

TGGAATATTATTTAGCAATAAAAAGGAATGAAGTACTGATGCTACAATATGATAAACTTAAAAACATCATGCTAAACAGC 

AGACCCAGGGTTAGAACACAGGCAGTCTGTTTCTGGATTCTATGTAGTTCTATCCTATATTGCTTGGTTTTCCAGGGTTA 

GAACACAGGCAGTCTGTTTCTGGATTCTATATAGTTCTATCCTGTATTGCTTGGTTTTCCAGGGTTAGAACACAGGCAGT 

CTGTTTCTGGATTCTATGTAGTTCTATCCTGTATTGCTTGGTTTTCCAGGGTTAGAACACAGGCAGTCTGTTTCTGGATT 

CTATGTAGTTCTATCCTATATTGCTTGGTTTTCACAGTCACCTCATTGCTTAGGAGCGTTTTCATCACTCTTGACTGTTT 

AAGAGCTCTTTAGTCAATTTCCTTTCACCATAACCTCTTGATTCCTGTGTTGTGCCAACAGAATCAGCAAAGTACAAAGG 
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AAAGCTGTCAGTGCAATTCACAGCCTGCTAAGTTCTCACGACCTGGACCCACGCTGTGTCAAACCAGAGGTGAAGGTCAA 

AATCGCCGCCCTTTACCTACCTTTAGTTGGCATCATTTTGGATGCTTTGCCACAGCTCTGTGACTTTACAGGTAATGGCC 

CTTCTGTTTTCTTTCTTGGATTGTTGGGGGCCCCTGCCAAATGCCCCATCCGAATGAGATCTCTGTCATTCGTTCCAGTG 

CTGATATGTTCATATGAGCAATTCTTCACAAAAATGGTCTCCAAACCTCTTTAACACTGGCCCCAATCAGTTAAAAAAAC 

AAAAAAAGTCACATAGCCCCATATGTGTTTTTTGTTTAGAAATTATTTACTTGTTTCTTCATTCATTCATTTATTCATTC 

ATTGATCTGCTAGTATGGAATGAACTTTATAAACCATACACCACAAAGGATGAAATAATAACAAGATTACTGCCTTTGGA 

GATCGTGCTCTAACACCTTGAAATAAAGGTGTTCCTTCCCTTTTCTTTTATGAATATGTAAGCAAAAAAGCGGTCTCAAA 

TGAACCCAACTAATTGCTCCATGCAAAAAAAAAATCATGGCATATGTCAATATTAGAAAATTCTATCATAGCAGAGAAAT 

ATTGCCCTTGGCACTAAGTGACTAATTTTGAGGTTAATATGTCTAGATTAGCAACAAGATGAAAGGATAAGTTGTCTTAA 

AGGGGTTTTGTGAAACCCCAGAATCTATTTACAAATTACATTGTGGATTAAGGAATGTAGAGGGAAAAAAATCCTCCCAA 

ATAGAAGAACTTCTTATGTTGACCTGAGAAAGGCAGTCCATTGGCCTACCCTCTCCACAGCGTACGTGTGTGTGCACGTG 

TGTGCGCGCGTGCACACACACACACACACAATTCCTATCCCATCAGAGTAGTTCTTGCTTTTCCTCCAGCTCAAGGGAAG 

TTTTGGAAATGTCCATGCTGCTCTCTTGGCAGCTGAAGGGTGCTGGGATCTGGGCAATGCTTCCCTGAGCCAGCCTCTTT 

ATAGTCAGTCATCCTTGAAATCTTGGAACCATCCAGGTTTTCCCACCTCCTCCAAATGGAAAGTGTCATGTGTGACTAAA 

TTGTTTATAATACTATTACGGTGACAAGTATTTAGTTCATGAAAATGGGACCAAACGCCTATAGTCTTTAATTCATACAA 

ATCATAAAATGAAAAGCAAGTATTCCTGTGGAAGTAATTCTCTCCTTTCTTGAAAAAAATCCTCTAGCAAACTGCTAATG 

GAGTAACTCAGAAGAGAAAGGTTGAGACAGGGGTTAGGGTTTGTAGAGTTCTGTTGGGGCCAGAGTGCAAGGAAATGATA 

GAAACAAGCTCAGCACACAGGGCTGAGCAAGCTGCACCATGAGGTCAGCAGCTTCCTCTAGCAGCGCCTGATTGCGCGGA 

ATTGAAAATGGAGTTGTTTTTAACATTTACAGACATAATGCAGAGCATGGCATGTGACTTGTAGCCCATTTTGAGAATCC 

TTGATGGCAAGTTTCTAAAAAGTTCCATTTCAAGAGCTTGCTTGTTCCCAAAGCCAGGAACCACTTTAGCACACATTATC 

CGAAGTTTTCTTGTCAATTAGAATATTCCGTATCAGTGACTCGGAATCAGAACTTTTCAACATTTGGTCTCCAAGCCTTT 

TAAACCTCAAAGACTTCTTTGTATATGCGTTGAACTTTATGATCATGATGTAGCTGATGAGAGAAAAAAACAGTGAAAAG 

TATTTTATTGTTATAGGTTACACAGTACATCTATTAAATATGGAATCTTTAGGTTGATAAACTCATAAGAATACAGTCTT 

TCAAAAAGATGCATCTGAATTCAGATTCCAGCCCCATTTATTAATATAAGTGACCTTTGAAAGGCTCAACCTTTCCGTGC 

CTTCCTTTCCTCTGTAAACTATAAAAATGTGATAACGATGTCTACCTTTTAGGTTTACGATAAAGGTGAAACTAGAGAAC 

TTCTATAAAAGCATTTTGCACAGCACATGTTTGTATCTTTCCCGATTTTTCTTGTAACTATAACCCTATGGCAATTAAGG 

GGAAATAAGAATGTGTCTCTATGTTAGTTGTGATAATGTTATCAGGTCTTGCATAATTTCCATGTGCTGTTTATTTAACC 

ATTTGTTTAAAATCCCAATGGCCTTAATAAGTATTTCTAAAAATTGGGTTTTGGCTCGGTGTGGTGGCTCACCCCTGTAA 

TCCCAGCACTTTGGGAGGCCAAGGTGGACAGATCACTTGAGGCCAGGGGTTCAAGACCAGCCTGGCCAACATGGCGAAAC 

CCTGTCTCCACTAAAAATACAAAAGTTAGCCAGGCGTGGTGGCGCACCTGTAATCCCAGCTACTCGGGAGGCTGAGGTAG 

GAGGATGGAGGGGCAGAGATTGCAGTGAGCGGAGATGGCGCCACTGCACTCCAGCCTGGATGATAGTGTGAGACTTTGTC 

TAAAAATAAATAAATAAATAAATAATAAGAACTGGGTTTTGTTTTCACAACTTTAGTAGAAGAAATGTTATTTACACTCA 

AATTTTTCTAAATAATTGAAGGCCCAGATGGCTGTAATGTCAACAGGTCTAGAAAAACATGAATTTTTAGGAAAACATGA 

GTGAATCAATCAGTTGTGAATGTTTTACCACACTTTTCCCAGAACTGCTTGAAATATTAATAGTTTTTGATTGTATACTG 

GTAGCATTCTTAAAAACCAGACTTTTAAACGGTTTATCGTTTTTTTGCTTGCATACATCTAGACCTATCTATAAATATGG 

GTATGTCTATATTACATAGATACAATAAATTGACATTCATTTTAAACATTTGAAATATGACAAATTATTGCTGAACTTGT 

GATAATAGCTTATTATTCTATATAAGGTAGTTGCTTAATTCTGTAATTGTAGGTGTCTTCTATTTGGTCATTATTTAAAA 

TAATGCCAATTATTAGAATAGAGAATGAAGTTTAAAAAATTATGTTACAGGAAACAATTATGGAAGGTTTGAAAACTTTT 

TGTTCACACAATTTGAAAAATTAATTTCTAGCCTAATCTTGTGCTAGACATTGTCTCTTAGCCTGCTGTGTTTTCCTATA 

GGTGATAGCAGATACATAATGCTAAACATCAGGTTTGAAATTACTGTGCTGACTTTAGTGACTGAGAAGTATCAGTCTCT 

TATTGGGTAGGGGACATGGGGAAATGTCATGTTTGACTTGACATCACAAACGATGTTTTCATTGCAGTTGCAGATACTCG 

CAGATACCGCACCAGTGGCTCGGATGAAGAACAAGAAGGAGCCGGTGCCATTAACCAGAATGTGGCTCTGGCCATAGCAG 

GGAATAATTTCAATTTGAAAACAAGTGGAATAGTGCTGTCTTCCTTGGTATGTTGGTGCACATGTGTCTGGTTGATTTTT 

CATTTCATGTCTTCTGTCTTCTGTTTTGTTTTGTTTGTTTGTTTGTTTTGAGACAGAGTCTCACTCTGTTGCACAGGCTG 

GAGTGCAGTGGCGCAATCTCAGTTCACTGCAACCTCCGCCTCCCGGGTTCTTCATGCCTCAGCCTCCTGATTAGCTGGGA 

TTACAGGCGTGCACCACCACACCTGGCTAATTTTTTTTGTATCTTTTAGTAGAGACGGGGTTTCGCCATGTTGGCCAGGC 

TGGTCTCAAACTCCCAACCTCAGATGGTACACCCACCTCAGCCTCCCAAAGTGCTGGGATTACAGGCGTGAGCCACTGTG 

CCCAGCCTCTTCTGTCATCTTCTGATGGGAGAACTCAGCTTAGAGCAGGCATTGATTATTTCCGCCTCATTTTGCTGGAA 

AGAATCATTGTGAGGCTGGCAGCTGAGGTGCACACAAGTCAGAAAGGGTTCTGGCAACAGCAAGAGATGTGCAGATCAAA 

TTCAGGCAGTCCCAGTGGTGCTGAGTTCAAGTTCAAGAAACCGGAAAAGAGGAAAACTTGGGAAATCTGAAATCCTTGCA 

AAGGAGATCAAACCAAGGCTCTCTGAGGGTGGAGGCTTAGGCTGTGTTCAGTCTATGTGTGTGTAATTCATACAGTGTAA 

TGCCTCCAAAAAAAAAGACTGAAGGAGGCTGGTAGCTTAGCGTAAGGGCTTCTTAAACAAGACTGCAGCTTTTCTAGGAC 

CCCAGGGAGCCTAAGGAGGGTTGTTTGGCTCTGTCCTTTTGCATGGACTCTGTGGGCTCAAAGAGGGGCTCCTCTGGGAA 

TGTATCCTCTGATTCAGTTGAGGTGGGATTCATCTCTGTTCATTGATGAGCTCTGACTATCCCTTGTGAAGACAGCACTG 

CATCCTCCAATGTCATTTACTTTCACATTTCAATGTAGGGTATAGATGTGCAGAGAATTTAACTTTCTTCTCCCGTGAGA 

AAGCAGCATCCTGACCTAACTCTTCTTGCTGGATGACTTTCTTGTCTTCCTTCCTGCACTTAGCAAGCTCCATCACCATG 

AGGATTGGGGGGACAGTGAGGCAGGAAGATGAGACAGAGAAAAGACATTTGAGATTTGACATCACCGATCTTACTGTTGG 
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AGAAATATTGTGTGACCTCCTCCACCTTTCTAATATCCGTCTTACCAAAGTTAGCTGCTTGAGGTGGTATATGCCTTTTA 

CATTATTTGCCTCTAAGGGAAAAACTCAAAAGCCCAAAGTTCACCTGTTAGAACATAGTCCTTGTGAGGTTGTATCTCAA 

GATTTCCTTTATTCTTGTCAACAAACTCAGAATAACTAAAGTTAAAGTTGCTTTATAACCTCTATTATTTCCATCCAAAA 

CTAACATTTCCCTCCCATACCCACAAATTCCTCATTGCAATTTCACCAGTACATTCCACTGGATATTAGCTACGCTGCAT 

GAACCAAGGGTGCAGCCTCATTGTTGTTGTTTGTGTTGATGAGATGAGTAACAGAGAGTGCGCTCAGCACTTTAAAATGA 

ATACTTGTGGCCAAAATATTAAAGCAAATAGCCTGAACCCCCACACCCCAGCCCCAGGCAAACATAAATTATGGTTAAAC 

TTCCATTACAGAGAACTCCACAAACATGGATTTGATTAATTTGCTGAGCAGTTCATGACCCTTATGTTATACTTTGCACT 

ATGTAACAAAGAAGCTAAATCTTGAACAGCAAGCTTATTCATGGCACCCAGCCAGTTTCCTTCTCCCCTGTAAGTCTAAC 

CCTCTGCTGCCCAGCACGAAGGAGAAATACTTGGAGATCTTAGCAGCATGAAAGCCTCTTTGTATCACTGGGATTGCAGC 

ACGCATGATCAAGGCCCAGGGGTGATCACCAGGCCACACTGCTCCTAAGACAGAGGTACTCAGATACGTGGCTGAAAGCC 

TAGCTCAAATACTGCCCCAGGTGAGCCTCTTTGCTGAGTAGCGCTACTCAAAGAACACAGCTTCCCCTGCCTAGGGAAGG 

CAGGGCTACTGGCAATAGATCTCCAGCCTAGCAGTGATGTACAGTCATGGTATTTTAAGAGAACACTTTGAATTTTTCTG 

TTGCTTGACTGTTAAGCCTCAAATTTTTCTGTTGCTTGACTTCTTCCCTGGCCTCCATCCCCCAATCTGCCTCCCTTCAG 

CCCTATAAGCAGTACAACATGCTGAACGCGGACACTACTCGCAACCTCATGATCTGCTTCCTCTGGATCATGAAAAATGC 

TGATCAGAGCCTCATTAGGAAGTGGATTGCTGACCTGCCATCAACGCAGCTCAACAGGATTTTAGATCTACTTTTCATCT 

GTGTGTTATGTTTTGAGTATAAGGTAAGTCTGGAGTGGCACAACTTTATACCAGCTCTTATCTCTCAATTGCAATTCTGT 

CTTCTTACTCATCCCCTTTGTTTGGGCCATGGAGGCATCATTAATTTTTCTCATTTCTGTATTCAAATCCATAACCCATT 

TGTAGGTATAGATATGATCATTTCACAGGGAAAGGATCTCTGCCTTCTGCAGAGAGAACCCCATTTCTGTTGACAGAGTT 

TTGGCCCATAGGATGCTCCAGAGCAGCATCTCAGTGAAGCACATGTCAAACTTAGCTGGCATCACTGTGGAGTGTACTGT 

TTTGGTAACTCTCCCCATCAACGGAGATCTCACCAAAGGACATGTCCTCCTACCTCTGTCTTGTCCAGGGAAAACAGAGT 

TCTGACAAAGTCAGTACCCAAGTCCTGCAGAAGTCAAGGGATGTCAAGGCCCGGCTGGAAGAGGCTTTGCTCCGTGGGGA 

AGGGGCCAGAGGGGAGATGATGCGCCGCCGGGCTCCAGGTGTGTTGGACTGGCCCTTCCCTGCTCTCTGTCAAGCAGTTT 

TTCACTGTTTGTGGGGAGGAATGTCCTCCCAACATGATTAGACACCATTACTTTCTTGAGATATTTACGGTAGTGTCAGA 

GACAGCGGATTCTGGGAGTCTGTGTGTGACATTTGTGTTAGCCCTGTGCCTGTGAGGGAAAGCGCTGTTCTACAATTGTA 

GGAACCCTGAAAACAGCACCAAATGACATCTTTTAATTAAAACTCGTTGATGGTAAAAGGTCACCTCTAAGAATGTCAGT 

CATGGATGAGGTGGAAGGTGTTTCCCTCAAGGCTGAGGCCTTTCTATGTGACCTTTGGCTAGTCTATCCGGAGCATGGTC 

AAGAAACTGAACAGTTCTGGCCCCAAGATCATCTTCTCGATCATTCAGGGCACTATTAGAATGGAACCAGTTCCCTGAAA 

TGGCTTCAGAGTGTCCCAGTCATTCAAGGATTCCACCACAGGAGGAATGATCTCAAAAAGGCTGAGCTTGAATAGAATGA 

AATCCCCAGGTACCCTCAGTCTTATTCACCATGCTCAAAGTAAAACAGAGTGACAGCTTATTGTATTCGAAGGGACACAG 

TGGCAGGGAACTTGGAGGGAGCTCATAGTTTTCAGTGGTGGTCAGGCACCCTCATTTGACACCCATACTTTCATACCCAA 

TAATTCAGTAAGCCCCCAGAGTTTCACAGGAATCCTCTGCTCATGAGAAATGTCTCTCTGACACTCAGAAAGGCAGAGGT 

TCTTCTTATATTCTAGTCTATCAATCAAAGAGCGGGCCAACTTGGACATAGGTGTGGCGACTTTGTCTCCTACCAGCAAC 

CTGCATGGACTCTAATTAGCCCGAGAAATGGTGCTGAGGCTTCTCAGTTGAGCTTGTTATGAACTTCTGGTTATCTTGGA 

GGGTTTCATGCTAATCAAATTCCTATCATGCATTTCTTAACTCCTAGGGAACGACCGATTTCCAGGCCTAAATGAAAATT 

TGAGATGGAAGAAAGAGCAGACACATTGGCGGCAAGCTAATGAGAAGCTAGATAAGTGAGTCACTCGGCAACTTTCTGCT 

ACTTTTACCTAAAGTCCAAAACTATTTTTCCCAGGCTGCTTGTATTACTGAAACAACTGCATCCTTCCAAGGGTTAGAAA 

ATGAAACATCATTATCTGTGTAAATACAATTCATCCAGGGACCCAGGATAATCAAAGGTATAGGGAGTTGTGGTTTCCAG 

CTCTTAAAAATTTGTCACATTGATAAGCATGTAAAGAAAACTAATATTTCTTTAGCAACCTCAGATGGCTTAATAAAAGC 

AGCTGATTTTGCAGGGAGGGTAGCAGGGAAATAGAGAAAGCAGGACACGGTGCCTAGGACCGTATACTTTCAAATCGATA 

TTTCCTTTCTGGAAATATGTACAAGATATACATTCAGATATATTTATGTCAGTGCTACTTAAAGTTGTTTTTTAAAATTG 

AAAACATTCTAAATGCTCCAGAATAGAAAAATATATTTAAAAGTTGGATTGCCATCAAAATGTTTTAAAATCATCTTAAT 

GACATGGGGAAATGCTTATGACATAATGTTAAATGAACAAAGGCAGGGTAGACACTTGGTTTTCTAGTAGTGTGTGGCTC 

ACGCCTGTAATCCCAGCACTTTGGGAGGCTGAGGCCGGTGGAACACCTGAGGTCAGTAGTTCGAGACTAGCCTGGCCAAC 

ATGGTGAAACCCCGTCTCTACAAAAAATAGAAAAATTAGCTGGGCATGATGGCAGATGCCTGTAATCCCAGCTACTCAGG 

AGGCTGAGGAGGAAGAGTCGCTTGAACCCGGGAGGCAGAGGTTGCAGTGAGCCAAGATCGTGCCATTGCACTCCAGCCTG 

GGTGACAGAGCAAGACTCCATCTCAAAAAAAAAAAAAAAAAAAGAACCTCAAATCTCTGATGACATATAGTCAATGTATT 

TTTTAAAGGCTAGAAGGAAATTTACCAAAATATTAATTGCGGTTTTCTTTGGGTGCTGGGAATATCGGTAATTGATGTTT 

TCTCTTTTATACTTTGCTATGTTTCCTACAGTAGATGTGTATTATAATATATATTGTTACCAGAAAAAAAATTATAAAGA 

AATGAAAGAATAAAAAAACCTGTTTGGATCAAAATCTATGCTATAGATGAATAGATATATTGACGTGTATTGCTATATAT 

GCAACAATTTAGACATATGAAAGCTAAAAGTATATATGTTTGACTAGATAGCACATGTTTCCGATACCGTTTAGCCACAG 

GTAGCCTAATGAGTCCCAGGGTTTGCTGAGCAAAATGTCGTAGAGCTAAAATCACCCCTCACACACAGTTGACAAGGTAG 

GATACCCCATGGGTGATGAAGCACCCATCAAAGGGGGTATGGGGCAGACTTGATTAGCAGTGCTGTTTTGTAAATAAATG 

TGAGTTAATATCTTTATTGTTACCACAAGAATGAATCATTCTGGTTAACTTATGGGACTTTGGGGAGTCACATTTATTTT 

AAAACATTAAAGAAATAAGTATATTTTTATTATAGATGTGACTTGGAAACCCAGTTGTTGGTATCATGCCTATTTGCTTT 

GGGTATGTTATTCTTGCACTAAATTCCCTGGACTGATTTATAAAATACTTGGATCGGAGAGTCCATACATTCTACTTCGG 

GCAATGAACAACAACTACAAAATATGCATGCCGTGGGTGCTACAGAATTGCTTAGGTGGTTTAGCTCATCTACTCTACTA 

GATTAAAAGAAGACCCCTAAGTAGGCCTGTTTTGCCTGGGCTGTGTAAACTGGTGTTTTGGAAAACCTTTCCCTTTTGCT 
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TGCAGGAGCAACACAATATTGGTCCTTACCACGCATGGCCTACCGGTTTGCTCTAGGTTAGTTCTCAGCCCTGCGCTACC 

TGGTGGGCTGCTGGTTGTCAGAGCTAATGATTCACATCTACAAATTCTCAGGGGCTTTGTGCTTTTTTTTTTTTTTTAAA 

AGGGAAACCATAGTAAAATTATAACTGTTCAACCTACCTACCTGGAAAATTTATAATTATAATAATGGTGTCTTTCACCC 

TCTTTCTGATCATTTTTCGATTATTTTGGCTCTGTTTCTTTTAGAGCAATGATCTTCAACTTAGGCTGCACATTAGAATC 

ACCTGGGGAGCTTTAAAACCTGTCAGTGCCCAGCTGCACCCTAGACTACTTCAATTGGAAACTCCAGCAGGACCCAGAAA 

TCAGTATTTGGTAAAACTTCCCAGGTGATTCTATTGCCACCAAATCCATTGTTTTAAAGGAATAGATGGTAAGCTTTTCT 

TTTCTTTTTTTTTTCTTTTCTTTTTTTGAGACAAGGTCTCACTCTGTCACCCAGGCTGGAATACAGTGGCACAATCACAG 

CTCACTGCAGCCTTACCCTCCTGGCCTCAAACAATCCTCCCACCTCAGCCTCCCAAGTAGATGGGACTACAGGTGTGTAC 

CACCACAACTGCTTACTTTTTGTATTTTTTGTAGAGACAGGGTTTCACCATGTTGCCCAGGCTGGTCT7VAAACTCCTGGG 

CTCAAGTGATCCACCTGTTTTAACCTCCCAAAGTGGTGGGATTACAGGCATGAGCCACTGCGCCTGGTCAGATGGTAAAG 

CTTTTAAAAAACCAGATTAGTGTTAGTGATGGTTGCACAATATGAATGTACTTAACACTACTGAACTGTATACTTACAAA 

TGGTTAAGATGGTAAATTTTTATGTTAGGTGTACTTTATCACAATACAAAAATTTGGGAAAAAACAGATTAGGACACTCT 

AGATTTGTGCTGTCCAATACAGTAGCCATTAGCCACATGTGACTATCAAATGCTTGAAATATGGCTAGTTCAAATTGAGA 

TAATAAATCAAGATAAGTGAAAAATACGCACCAGATTTTGAAGGCTTATTGTGAATAAAAGAATAAAATATTTCACTAGT 

AATTTTTATATTGCTTACATGGAGACAGTATTTTTTATCTTTTAGGATAAACAAAATATATTAAAAATAATTTCACTTTT 

TCTTTACTTTTTTAATGTGGTTACCACAATATATAAAATGACATATGTGGCCCCCATTGTTTCTACTAGACAGCATTGCT 

CTAAATTTAAAACTAAACTAGCAGTCAATTAATTAAAAAGGATGATAAGGGGCCGGGCACAGTGGCTCACACCTGTAATC 

CCAGCACTTTGGGAGGCCGAGGTAAGTGGATCACGAGGTCAGGAGATCGAGACCATCCTGGCTAACGCGGCGAAACCCCG 

TCTCTACTAAAAATACAAAAAAGTAGCCGGGCGCGGTGGCGGGCGCCTGTAGTCCCAGCTACTCGGGAGGCTGAAGCAGG 

AGAATGGCGTGAACCCGGGAAGTGGAGCTTGCGGTGAGCCAAGATTGCGCCACTGCACTCTGCACTCCAGCCTGGGCGAC 

AGAGGGAGACTCCGTCTCAAAAAAAAAAAAAAAAAAAAGGATGATGAGGTTAAAATGGTAAATTTGATGTTATGTGTACT 

TTATCACGATATAAAAATTTGATGGCTCATGCCTGTGGTCCCAGATACTCAGGAGGCTAAGGCAGAGCATCACTTGAGCC 

CAGGAGTTCGAGGCTTCAGTGAGCTATGATAGTGCCACTGCACTCTAGCCTGGGTGATAGAGCAAGACCCTGTCTCTAAG 

GAAAAAAAAAAAAAAAAACTTTTAGATTTCATTTATTTTACACATATATTATCACTTGGAAAATGAGAAAAAGTGTCAAG 

TGGCTTGGGACCAGAGAGCCTATCCTAAACATGAAAACAAGTAAAACACACAGAAGTACTTATTTTTTGAGTCCTCAGTG 

GTATGTAAGCAGCTGCAGTGCCCCCATTATTAGGTTAATGGGACGCAAGAACAGGTAAGTGGTAACCCTGGCCCAGGACA 

TATGAGCTGATATAATGATACCCCAACCCCATGGTAACATCTTGGCTACTGAGGCATCTTGGTAAAGTCAATTCTTCATA 

CCTCCCTTTCCTTGCAACTAGATTTGGATGATGATACAAAATATCCCTTTACAGCTTCACTTAGATTTCATAAGAATGGA 

TGGGCTAGCAAAAAAAACCATTCTGATTCCTTAAAGTGAAACCTAATAGAGGAGCGAGCACCAGGCAATTTCCCATTCCC 

TAGTGGGCAATGACCAGTAATGTCCGGCAGGATATTAGATCACCTGCCCTACAGGAATACAGTCTTGTTTCCCAATGGAA 

AAGGACGAAAGACCCCACGCACTTGGCTGAGCAACCTCAAGGTGATCTTTGGGAAGTTAAGAGGCTGACTCTCCCCTGAC 

TTGGCTCTGAAGCTCCACCCTTTCCTAACCCAGCCGCATCACTGCCAAGTTCACATCACATCCAGTCTCACTCTCTCTCG 

CATGTTGCAGGGGCTTCCCCTTACTCAGATCTAGCAATGGTTTTCATGCGTGAAATACAGCCATGGCCCTGAGGCTTTAG 

GCAACAATCTGAGAGGGGAGCTTAATTGCTAGTAGCAACTAATAACTGCTTCTCTACCCATAGTGTTATTTTTATAATTG 

TCCTCATCATTATTAATAATAGTGGGGATGAGGATGACCAGGAAACCTTACCTAGACAGTTGTTTCGACAAGACATGAAT 

CACAGAAGGCACCTGCACTGTAGTTACTCAGGGCCAGTTGCTCTGTTTTCATTTCAAGGTTGACTATTTTGGAGATTTCT 

TTACACCTTGGTGTATAGATTGCCATCATGGGAACCTGGCCAGGTTTGACATGCGCTTTAATTTGACCTCTTGTTGTTTC 

CTAGAACAAAGGCCGAGTTAGATCAAGAAGCCTTGATCAGTGGCAATCTGGCTACAGAAGCACATTTAATCATCCTGGAT 

ATGCAGGAAAACATTATCCAGGTGAGGAAAACAAACACCCAATCTGATTTGTTGGCCATGAATATGTTTACTAGAATAAG 

GACTTCTTTATGCAAAATTGTGAAAGACATAAATGTGATCCCATAGTACCTTTTTTAAAAAAATGAAGTTGAGAAGTTTA 

CTATTTACAACAGTGTCTACCTTATAAATTCCAGAGATACCAAACATTCTTCTGGCTTCTTTGACTTAGGGCTTACTTGG 

AGAGGGTTAGGTGTTTGGCCAGCTGACCCTCTTGGTTAAATCTGTGTGAGTATGTACCAAGTTTATAATATGGATGTTGG 

GTTTATCGTTTAGTATCTAGAACAGTAGTGGTAAGTAGAATTTTTTCTGATGGGTCAACTCCAGTTGAATGATGGTCACT 

GTCTGATATGGGAGCTATGATTATGACTAGGCTAGGTAAAAAGAGTGCTAAATTTGACAAATGATGTCTTCTTTGGACTT 

AAATTTGTTAAGGAAAGTCATTTGTACCATGAATTTGCCATCCCTGCTGTAGAAAAATATAGCTTTGTGAACTTTGTACC 

ATACTAATTTTATCTTCTATGTGATTATTTCCACAAATTCCCAAGCTGTCTAGGTAATAATGAGTTTTTAATTACCCTGA 

AAAATGAGTTCTTACATGTTTCCATTGAGAAGTCATTCATTAGAGTAGGTCCAGGATTGCTTTTAGGGCTAGAAGAAATA 

TCGTTGAAACACAGTGAAATCTTAATTCTCTAACTTTTGAATTGTCTAAAATCAAAGTAATCATCATACAAAAATAAACA 

CAAAAAGTATGTGATATTTTTGTTGACTTTAATATCTTTGATAACTTAAATGCTTGGTATCACATTTACCTTATCTTTAT 

ATAGCACAATATTAGGTGCCAAATATCTATACTAGCCCCCAAATATATTTGCAGTTTTCAAAGAAAGCTGAAACCTTTTG 

TTATTATCCTTGGTGTTGTTAGTCCTTCTGTAGGTGATAAACAAGCTTCTATTTAGAAACATTGCTGCCACCAAGCAGCC 

CCTGTTGTACTGGGAAGCCCACAATTGTGTTTTGCATCCCATAAGGAAAGCTATGTCTTGTATACAAAGAAAGAACTTTC 

CAAAAGATGTGACCCAGGATGAGGGAGATGGGCCTTATACCTTCATTTAGGAACCCAGAATTAGGTATAAATCCCAAACT 

CATTGGAAGCATTGAAATAAAGCCATTTGGAAATAGGTCTTCAGTTCCCATGGTTAATGGATGATACCCATGGTGGCTCA 

CCAAACTCTTAAGACTCACCACTGGACATGGAACATCAGCATTACTGAGCTAATTGTCAGGAACATCCAGTTCATTGGCA 

CAGTGCAGGGATTCAATGATGCTGTTCTTCCATTCCCCCAGGCGAGCTCGGCTCTGGACTGTAAAGACAGCCTGCTGGGA 

GGTGTTCTGAGGGTGCTGGTGAATTCTCTGAACTGTGATCAGAGTACCACCTACCTGACTCACTGCTTTGCAACACTCCG 
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TGCTCTCATCGCCAAGGTAAACTTGGGATGCTTGTTTTCTTCCTCTTAATTAAGAGTAAGATTCTCATCTAGCTTCATAC 
TTCTCTCTTCAGGTGGACCAAAAGTCACAGAGCATATTAAGTGGCATCACAGTAAAGGTCTTAAGTCTTCCTAGGAAGAA 
AGCAGATGCCCTGATTCTGTGGGAAGCCACCATGGAGAGGAAAAGCAGTGGCTCCCATATTTGAAGTGTGGACCTAACTC 
TAGAAGTTTAAAATGGCCATTCGCTGAAGGTCTATGACATGAGAACAGAGATCAACTGAGTGACTTAGCAATTTCACTCT 
TTCTCTGTAATACCTCTGCTGAGTGAGATTAAATCCTCTATGTGACGCCCATTAGTCTTACAAAATGTCATGCCATAAAA 
TGCCAGGAAGGTCAGAAATGAATTTCTCACGGCCTGAGGAATGAGGATTATCCTGGGGTAACATGCAGATTATTTTTCCC 
TTTATTTATTTATTTATTTATTTTTGAGACTGAGTCTCGCTCTATCGCCCAGGCTGGAGTGCAGTGGTACCATCTCAGCT 
CACTGCAGCCTCTGCGCCCTGGGCTCAAGCGATTCTCATGCCTCAGCCTCCTGAGTATTGGGATTATAGGCGTGTGCCAC 
CGCACCCAGCTAATTTTTGTATTATTAGTAGAGACAGGGTTTCACCATGTTGGCCAGGCTGGTCTTGAACTCCTGACCTC 
AAGTGATCCACCTGCCTCAGCCTCCCAAAGCACTGGGATTACAGGCGTGAGCCCCCGTTCCTGGCCTATTTTTCCCTTTA 
TTGAAGATCTCAATTGGTGCCTTCTACATGGGGTCTTTTAAATTTAAAAAGTAAAATTCTTCTGCTCATCCTTCTCAGGA 
CCATTTTCTCTTTCTTCATCACCAGTAATTTCCCAGGAACCCAAGAAACTCAGGTTTCCTTCCATCATAGTTGTGATTTC 
ACCAGTGAATGCGACCTGGCTCACAGTGCAGTTGATAACACAGCTCTGACCCTTTTAGCTGGACAGTTCATTATTAAATC 
TCAAGTCTACTCCATTGCTTAAATCCATCTTCTGATTCACATAGCTCATTATCTTTATGGAATAATGCATTAACTCTTCT 
AGGCTTTTTGCTTGTCCAAATGGACATTTGCATATTTCAACGGTCCAGAAAGTGTATCAAACTGCCAAGTGATGCCTAAT 
GGCCCTTTATGTCTCTCCTAGTTTGGAGACTTACTCTTTGAAGAGGAGGTGGAACAGTGTTTCGACCTATGTCACCAAGT 
CCTGCACCACTGCAGCAGCAGCATGGATGTCACCCGGAGCCAAGCCTGTGCCACCCTTTACCTCCTCATGAGGTTCAGTT 
TTGGAGCCACCAGTGTAAGAGTTCAAACCAGCTGAGTGACCTGGAATCAGTAGAGAAAAATTGATGTAAAGCATCAGCTG 
CGAAAAAAAATAAGGAAATTTTGCAGTATTGCAGTTTACTTCTGTCCTGTGAGAAAGAAACAATTGAGTATGTAGATAGA 
TAGCAGCTTCCATTTTAATTTGCATCTAAAAGTGAATTCATCAGATAAATGCAGTGGTCTCTATCAGTGTGTTTCTAAAA 
TAGACAGCCAGGGGCCAGGAACGATGGCTTTCACCTATAATCCCAGCACTTTGGGAGGCCGAGGTGAGTGGATCATTTGA 
AGTCAGGAGTTCAAGACTAGCCTGGCCAGCATGGTGAAATCCTGTCTCTACTATAAATACAAAAATAGCCAGATGTGTTG 
GCGCATGCCTGTAATCCAAGCTACTTGGGAGGCTGAGGCAGGAGAATTGCTTGAACCTGGGAGGCAGATATTGCAGTGAG 
CCGAGATTGCCCCATTTCACTCCAGCCTGGACAACAGAGTGAGACTTCATTTCAAAAAATAATAATAATAAAATAAAATA 
ACCAGGTGCAGTGGCTCATGTCTGTAATCCTAGCACTTCGGGAGGCCAAGGCAGGCAGATCAGATGAGGCCAGGAGTCCA 
AGACTAGCCTGGCCAACATGGTGAAACCCCCGTCTCTACAATACAAAAAATTAGCCGGGTGTGGTGGCACACACGCCCGT 
AATCTTAGCTACTGGGGAGGCTGAGGCACGAGAATCGCTTGAACCCAGGAGGCAGAGGTTGTAGTGAGCCAAGATTGTGC 
CACTGTATTCCAGCCTGAGACCCTGTCTCAAAAAAAAAAAAAGAAAGAAAGAAATGGAAGAGTATTTTAGATTAAAAGTT 
ATCATCTGTGGGGGAAAAAATACAATAGACAGGTTAGAATTCAGAAGAGTGTTTCCTGTTTCTAAATTCTGACTAGCTAG 
TGCCAGAATGACCTGTGGAAGAGGATTTTAAATGATCGGTGTCATCTAACCTGAGTTTTATTTTAATATTTTATTTATTT 
ATTTATTGAGACAGTGTCTTGCTCTGTCACCCAGGCTGGAGTGTAGTGGCACTATCAGAGCTCACTGCAGCCTTCAACTC 
CTGGGCTCAAATGATCCTCCTACTTCAGCCTCCCATGTAGTAACTGGGATTACAGGCACGAGCCACCTTACCCAGCTAAT 
TTTTTTTGCATTTTTGTTGAGACAGGGTCTTGCTGCATGCCCAGGCTGGTCTAGAACACCTGAGCTCAAGTGATCTTCCC 
TCCTCAGCCCCCCAAAGTACTGAGATTATAGGCATGAGCCATCCTGCCTAGCCAAGACTTGAGTTTTATTCAAAGCTACG 
AAGACTTTGGAGTTCAGCTTTATTATAGAACAGTCAAGTTTGCTTTAGTTTGTCTAGATTTTGATACCTTCTTTGGAATT 
TCCATTTGTGGCCATGTTAATAAGTATGCTCAAGTGATATATAAAGATAAATTGGCCCATGGAAAAAAGTCAGCCTCCTC 
CAAATGTATTAGGGATGATTATTTAAAAGACATTCCTCAGGGGACCTTGAGGTAGCCATGTTTTTCCATGGGCCTGTAAA 
GAAAGAAGAAACAAAACCTTGTTGCTTACCCGGAGTTCAAAATCTCAGAAATGCTGGCCACAGAAGTCCCCTGATTTATT 
TATTTAGAGACAGGGTCTCGCTCTGTCACCGTGGCTGGAGTGCAGTGGCGAGATCTTGGCCCACTGCAACCTCTGCCTAC 
CAGGTTTAAGCAATTCTCCTGCCTCAGCCTCTCGAGTAGCTGGAATTACAGGTGTCCACCACCATGCCCAGCTAATTTTT 
GTATTTTTAGTAGAGGCGGGGTTTTGCTGTGTTGCCCAGGCTGATCTTGAACTCCTGAGCTCAAGTGATCCACCCTCCTT 
GGCCTCCCAAAGTGCTGGGAGGCTGAGCCAGAAGTCCCTTTCTTTTAATAAAGTTTAAATAAAGTCCCAAGAAGAAACTC 
TTGGCACAAAAGGATATACTGTATTCTTGGACCCAACTTTATAAGAATCTTCCAGCTTGCAGCACAAAGGCAGCCCAGTC 
CTCAATGAAAATTTAAAGGGAGCCTGACAGATTTATGTGAGAGCAATGTCCATTTAAACCATTTAAACAACAATATGAAT 
GTTGTGCAAAGTGTAGCTCCCATTTCATTGAGAGAAGAGGAAATAATTAAGACGGGGCAAAGGAAACACTGAGGAGTTGT 
TTGTGTCTGGCCATGCTGCTTTCAGTTATCTACTGCTAAGTGTGTCTTATTTACTTCATCTTTTTTTTTTTTTTCACTGA 
TGCAGAATTTTGCAAGAGTAAAGATGCAAGTAACCATGTCCCTGGCATCTTTGGTGGGAAGAGCACCAGACTTTAATGAA 
GAGCACCTGAGAAGATCCTTGAGGACAATTTTGGCCTATTCAGAAGAGGACACAGCCATGCAGATGACTCCTTTTCCCAC 
CCAGGTACACCGAAGCACATACCTTGTCTCATGCATGAGTTTGGGATCTGCCAACTATTGTGTATGTATGTATGTACATA 
TATACACAATTTATATATAAATATATAACATTTGCAAGTATTTATTGTCCAATATGCATGTGCTCTCAGCACTCTGAGAG 
GTTTTAAAAAGAAATACATCCATCACTGTCCCCAGTCTCAATATGCTAACAGTCTATTTGGAGTGCTCAGTCTCAAAACA 
ATTAGGAGGCAGTACAAGACAAGTGATACATAAGTGCAAACTGTGTGGTAAGGAGCTAAATGCTATGGAAGGTTAGAGGA 
AGAGAGGTCGTGATCAATAAACAATCTGGTGGCCAGGAAAGCCTGCAGGGAGGGATGACACTAAAGGCTGGGGTAGGATT 
CCGCTAGGCAGGGGAAAAGTGAGGGGGAAATTGTGAGTAGTTGAAACAATGAATTGTTTGTATCTGTAGGGATGGTAAAG 
GGACCAGCTTGTCTGGTGAGGGGTAAGGAAGGGCCGAGTTTGAGAGAGCTTGTAAACAAGGTTGTCCTAAGGAAATGTGT 
AAGGACAGTCATCTCTTCCATGTCCATTACGATGACCGTGGAACTCTAGCCTGTCCCATTTGTCTTTTAACTCCAGCTCC 
TAGCCTGCTCTATGGTAGGGTCTT7VATTAAGTGAGAATGAATGAATGAATGTTTTCTTTTATTCCCTTAGTCTGAGAACC 
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CTATTTCCGGAGTCTGTTTTGCCCTCTCGGGCAATGTTGCATCCTCATTCTCAGAATCTTTTTCCTCTCCTCTACATCAA 
ATCGTTCCTCTCTCATTTCATCTTGACCATACTCCTTTAGTTCCATCATTTATATAGAGGTATTCACCAACAAGACCAAT 
CAAACTATGGGCAGTTTAATAAAGGTCTTCAGTGCCTTCACCCAATGAAATGACTCTAGTGGTAGAAATTTTAGGAGCCC 
TGGCAAGCTGGCAGAGGGGAACGGGGATAAGACAACATTCTGTGGCTGAGTTACCTGCCAGGGTCTCTAGATCAAGCCAT 
AGTCTCTCCCTGTTTTTGTACTGCAGGCTCCCCTGGACCTCCACTGTTGGTTTTATAATTAAGAATAAATGATTACAAGA 
GGTTCTAAAATCTCTGAAGCCCTGGGAAGATCCAGGAGGCTTCTGAGACATGGAACTCAAGCTGAGGTCCTAAGCTGCTT 
CCTACTTGGTATAAAAATCCCTGATATTCCAGAGTAGAGTTTAGAACTTTTCAGGTTACAAATAACTGAAACTGGTTCAA 
ACTAATTTAAACAAAAATGTTGGAGATAGAGATAGTATGAGGATTCAGGCAGATTTCTGGATCTCAAGGGCCAACACACA 
TCCAGGTCTCATAAACTCCTGGGCTGGCGAAGATGAAAACTACAGAGTCAGGTCTAGATGACTCCGCCATCCCTAGTCTC 
TGCTGCCCTCTGATCCAACCCTTCCCATGGCTTCCCCTTGCATTTCAATGTAATCCTAACTCTTCACCTGGGACTACAAA 
GCTAAGATTATTTTGAGGCTTACACTTTTTGCAGGTGGAGGAACTTCTCTGTAATCTGAATAGCATCTTATATGACACAG 
TGAAAATGAGGGAATTTCAGGAAGATCCTGAGATGCTTATGGATCTCATGTACAGGTAAGCTTTCCTGACACACTCAAGG 
GACACCATTTGGGGGTCGAGGATTTGTCACTGTGGAGTTCTTACTAATGTAATGATCACAGCTAACATGGATATAGTGAT 
TTGGATGATAGCCAAATAATATATAGAAATTAAACATTCAGAGTAGGTTAATTCATATGTAAGTTTTCAGAAGGATCTCC 
CTAATTTAAAGTGAGGCATAATAATGTTATTAAATATAATAACATTATTAAATATAATAACACTATCTTCTACTTACCAC 
AGAATCAGAGAAAGAATAATTAACTGTTATAGGAATTATTTTCCATATGCTTTTGTTCCAATTATATTCAGACATACATA 
TATAGTTTTACTTTAGAAATCATTTTTTACGCGTAGTTTAAAAGTTGGTCTTCCAGTCCTTGTTCAGGAGAAATTACTTA 
CAGAGGCAAAATTGTTCTGATGCAACATCATACAAAGGGCAGTACTTTTGTCTTCTGTTTTATTTTGAGAGAAAGGAAAG 
AAAAGGCAGAAATTTGCCTGAGAGCCATTAAAATAGACATCATGTTATCAGGTATTTTTTCCCCATAAGGCTTTTACTAA 
GTACTATTTCTTGGAGGTCAGCACAGCTTAAACATGGAATAAAAATAGTTGGACTAATAAATGTTTTCTTCTGTCTCGTT 
TTCTGGAAATATAGGGCAAAATCTCAGGTGGAGGGGTACAGGGAACTCTTGGGGAGAAAAAAAGAAAAGGTCACACAAAG 
TAGAAGAACAGTGTCATTAACCCACTGTCCTCAAAACTACTTCTCACTCAATCTGTCTTCAGAATTGCCAAGAGTTACCA 
GGCATCTCCTGATCTGCGGCTGACCTGGCTCCAGAACATGGCAGAGAAACACACCAAGAAGAAGTGCTACACGGAGGCTG 
CCATGTGCCTGGTGCACGCCGCTGCGTTAGTGGCTGAGTATCTGAGCATGCTGGAGGACCACAGCTACCTGCCCGTGGGC 
AGTGTCAGCTTCCAGGTAGGGTGTGTGCAGCTTTTCCCTTAGAGCAGTGGTTCTCAACTGGGGCGATTTTGTCCCCCAGC 
CCCAGGGACATTTGGCAATGTCTAGATACATTTTTGGTTATCACAACTGGGATGGGTGAGTAGGTGCTACTGGCATCTGA 
CTGGTAGAAGCCAGGATGCTGTGAAACATTCTGCAATAGGAGAGCTCCCCTGACAAAGAATTGTCTGGCCCCAAATGTCT 
GTAGTGCTAAGGTTGAAAAATTCCAAGTTCATACATTACATTTGCTTCTTCTAATTGCTTTCCCATCGTCGTTGGGTTTT 
TTTTAAATTACTGTTTACAATAATGGCACCTAGCCATTATTAATAGCACTTTAGGAGACATTTGCAAACACTTTCACATG 
CATGGCTTCATTTGAACCTCCCCGTAAAGCTGTGAGGCAGGTAGGTAGGGAAGGCGGTTATTATTCCCACTTCGCGGATG 
AGAGAACTGAGAGAGCAAGTTTTCTAAGGTCACTTAAACTCTTTTTCAAAGACTTGTAGTTGACACAGTATACTGACATT 
GTGAAAGTTTGGAAAACATTGGATAAATGATTTTCCTCCTGGCCCATTCATTTGATTCCACTCTTCAACTTTATAGGGGC 
CCACTCTCCAATCCAAAAATCAAGAAAGAATCAAATTGACCTGAGAAGGGAGACAAAAGGCTGAATCAGTACCTTCTTAG 
GATAAGCTGAAAATTACCCACATTTGGCAAAGGGAAATTGTCTGCCAGACCTAAAAGCTGGCTCGGATGGGGATGCTCAG 
TGGCCACTTAAAGTGTTCTTACTGAATAGTTTAATAGTCTTTAGAGAGAGAGAAAATAAAAAAGCACAATGTTGGGTACT 
TTTTTTGTAAGAGACATAGTTTGTAGAGATGACCATCCCTGAAACCATGAACAATATAGCTACAGTAATAGAGTGTTTTT 
CAAGCCAGACTCACGAAGTCATTTACAAGGGTTTGTATTATTCTTGTTTGAATTTACATGGCTGATTTTATGAAAAGCTT 
TGTTCTTGTTATTGTTCTTCAACACAATTTTGTGATGTTGTATGAACCAGAAAGAAAGAACAATTCAAAGTAGCTTCCCC 
AGGCTTAGAGAATAAGTCACTGAAACTATGCTGGTGCAGCCAAGAGCTTCTGGTTTTCCAGAACACAGCAAAGCTGGGTA 
TTGCCTCCTATGAATAACTCCTCCTTTCTTATGGTCCTCAAGAACAAAAATAGTCATTCTGTGATTTCATGCTTGGCAAA 
TGAATTTTCTTCTTAATTCAGAAATGTTTGTTATAAAAGCTGATAATTAAATCTCATCCAAAAGCATAAAAATAACACCT 
GATTTCAAATCACATAGAAGTGTACAAGAAAGTTATAAAAGTTATTTTAGATGTATTTGCTTCTTTTCTCAAATTTATTT 
TGACCATGTGATACTATCAAAAATAAATATTATAGTACTACTCTTTCAATTCCCAGGAAATTGTAAGGTTTAGCACTTCA 
TATTGTTTCATTTACTAAATTATTTTATACTTCTTTTATTCCTTTTCCCATGACTATATTTTATTTTATATTTATCACTT 
AAATATCAATTGCATTTCATTATTGACTTTTATATTTAGTAAGCCTTACTGTTCTAATTTTACCTAGAATTCAGTTGATT 
TGCTAATAATGACATGCCAAAGTGAATCATTATTACACAATCAACAGAAATATTCCACATTATTCCGACATGGGGGCATA 
CAGCTCTATCTGTTCACATATATTTATCCATTGATTCCTTCTTTTAGAGAATATTTATTGAATACTTATTCTGTCCTCAT 
GAACGTTACATTCTAACCAGAGAGACGTAATATAACTAATTATTCCAATCTTTGTTCAGTTATAATTATGAGAAATACTG 
TTATAAAGAGGCACAAGATATTGTGAGCATTTATGGTCCAGGCCCTGTGGGTCAGGGAAGGGTGAAGAAGGTGAAAAGGA 
AGGCAGAAGAAACTGAAGTGTGAGGGCTTCTTGATGTAGAGGAGGCAATGAGTTAGGTGTTGTCAGCTACAGAAGAGAAG 
CCAAATTATATTAATGTGTATGAGTGAATTCTTACTTCTCTCAAATGGGACATACCAAATCAATTTGGAAATGTAGCTGG 
CAAGTGAAAGGACACCCAACCCAGACTGACAGAAGGTAAAAATGGAATTTATCAGCCCGTCTGAGGAATGTGAGCCTGGA 
GTAGCACCTGCCATACTGGACCTAGGGGCCCAAACAATGTCAGCAGAGCTCACTGTCATCACCACTTCCCATCTCTGCCA 
TCTGTTGTGTCAGCTTCATTCTTCACAGGCTTCCCACACCACAGTTTAGGTTTACATCTCTAGGTTCAAGTCTAGGAGAG 
AAGAGAGATGCCTCCTTTCTAGCAGTTGCAGAGAAAGCCTCAGTGCATCTTACTGATCCTGTCTGAACATCTGATCCTGT 
GCTCATCCCTGAACCAGCCATACTGGCCAGAGAAATGGAGTGCTCTGATTTACTGACCCCGGGTCATGGGTCCACACCTG 
GAATTATGGGTGATTTCTTCCCCTAAATCAGGATGCCTTACTTTATAGAAGGAGGAAAGAATTCTAGACAGCATTAACAA 
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CAGATGTCCAGTGTTGATTTTGCAAATGCTCTGTGCTTTAATTTTCAACCTTGTTCTGCTCCAATGAAATAGAGCTTTTG 
GAAAAGATTTATAAACTAGAGATAAATAAATATGTGCAAGGAAAATAACTTTGAGGTCACTGAATTCCAGGAAACTGAGA 
TCACT6AAATTCTGTGTCCCAGAGTGCAATATTTATTTCACAACTGTAGATACGGACACATTCTTAGATACTGCTGTTAC 
TTGTACCTCCCTGATCCTGAAGCCAAGAAAGTCTGCAGAATCCTTTTCCTCTGACTACAGCTAATGAGGCTACATAGCTT 
CAAATCTGTTCCTCTAATGTGGAAAATTGCATACATTCTAATAGTATTAGTATCTTGGGTTATTAAATGCTrTCAACTGA 
ATTTCTTGGATCTTCTGTTGTCACAGAAACATCATAATACATAGGGCAGGTTTGGAAGAAAGACTGGCCACAAAGGCTTT 
GAGAGCCTCCTCCTATATTCCTAAAACTACGTTACAGTATTGCATGTGAAGAGATAGGGCTATCTATGACAACTATGTCC 
TGACTGATTGCTAAGGTTGATTCACATGATCTTGCTAACCAGGCCAGAAGGCAGACAGCTTTTAGTTCACAAGCCAACTC 
TGATCAGTTAGTAGTGGCTGACTGGAGAACTATGCTTAAGAATTTCGAGACTATGTCCAAGCTCTGGGGAAAAAGTGCTA 
CAGTTGATTAGTTATGCCTGCCATGATTACAGCAATAGGAAGGAGTGGCATGTGTGCCACCTGTTTGTAATCCCTAAACT 
GGGAAGGTTTCCCATTTCTTCTGTTTTTCATATGCATTTCTTCCATAGCTGTGAGCTAGGAAGAAAATGATTCTTGACCT 
GTCACATATTCACTGCCAGGGCCAGTGCTAGGGTGAAGAGGCACTCACCCTCAGGGTCGAGCGGGTACAAGATCAGTACT 
TCCATGGCCCTAAAAGCGAGTACCTCTCTAAATTTTGTCTTGGGTTTCTCATTTGGTTCACCCCAACCATGGTCTCTGCA 
TGCTCTGCTAGAGGCTCTAAACGCAATAGTTTATGTAAAGGAAACAAATGCATGGAAACAAAATGTTCAGGAAGAACAAA 
AACACACACACAGTAACTGCTGCAATGCCATGAAAACTTCCTTAATGAAGACAGCCTCGCTTGCTGTTGTCGTATGTCAT 
GGCTGTTTATCTGAGTCAACTCCAGAGTAGCAACATACTTCAGAAAAACACCACTGTAAGTCAGAGGTCCACTCGGTGAA 
ACAGGGAGCCTAGTTAATGTTAATTGGGTCTTTGCCTTTTGAAAACCAGGACACCAGCCCTATGTCCCTTAGGGTTGTTT 
CACTAAAGTAACTCAGCTGTTGTGACATTGAGGTAAGTGTCCTTTATACAAAATCTCCTAATGGTTAAAAAGAAAAACGT 
GAGGTTTGAAGACCAGTTGCTCAGTGCGCCTCTTCTAAATGAATGGCAGACAGATACTCTCGGGGTAGAATTACAGACCT 
AGTTTAGTCACGGTCTTGGTAAGGATCTGCACACCAGCTTCCTCGTTTCCCCATTCGGGGTTCCTGTGGTCTCTTACTAG 
TCTGGTCGCCCTGTTCTCCAGGCTTATACTGTGGTCTCTTTCAGAATATTTCTTCCAATGTGCTGGAGGAGTCTGTGGTC 
TCTGAGGACACCCTGTCACCTGACGAGGATGGGGTGTGCGCAGGCCAGTACTTCACCGAGAGTGGCCTGGTAGGCCTCCT 
GGAGCAGGCCGCGGAGCTCTTCAGCACGGTCAGTGCCCAGAGGGCATCCCGGGGCCTGGCCTCCCATACTCCAGCTGGAC 
TTGGGGTGCTGGGAACACCTGGTCTTAATGGCCCAGTCAGCCCCACTTCCCGAGGACACGTGCCAGGGTGTGCGGGGCAG 
GGGATGGGCCCGGGGAGGACTTTGATGTATGCAAATTGCATGAGCTTCCCAAGGGAGCTGAGATAACCTTTCATCACAGT 
GCCGATCTGAGCTTCACTGTATGCTCATTGGTTGGGCAGCAGTTTCACAGTATTATTTCTATTTAATAGGGGTGGAACTA 
AGCCACAGAGAGGTGAAATGGCCTGCCCAGGGTTACACAATAAATGATGAGGCATGTTTTCACTCCCTCGTTTTTCCTCT 
CAGAGAGAAAAAAATTAGGGAGGAACCACTGGGAGGAGAGAGGAGGAATACACAGACAGTGTCTTCCCTCCTAGCCACTG 
TGCAGTCTGAAGGACCATCACAGACCAGGACCAGCTTACAGAAATGTGGGCACAGAAACCACTGAGACTCCTCTGGTTAA 
CGTAATCTGGATCTAAACACTCCTAGTATATATACTAGAAAAATATATAGAGAGATGAAGTCATTGAGATTCAGGGCAAA 
GAGGAAACACTCTTGTCTATTTTCTTTTCTTTTTTTGAGACAGAGTCTAGCTCTGTCGCCCAGGCTGGAATGCAGTGGTG 
CAATCATGGCTCACTGCAGCCTCTACCTCCAGGGTTCAAGCAACTCTCCTGCCTCAGCCTCATGAGTAGCTGGGATTACA 
GGTGTGCACCACCACACTCGGCCAATTTTTTGTATTTTTAGAGGAGACAGGATTTCACCCTGTTGGCCAGGCTGGTCTCA 
AACTCCTGGCCTCAAGTGATCTGCCCATCTTGGCCTCCCAAAGTGCTGGAACTACAGGTGTGAGCCACCACGCCCAACTT 
GTTTTCATTTTAATAATCTCCCTCCTCCTTTACATTTTAAGCCAAGAAAGTATTCAGTACTTTACTATATTTAGCTGACC 
CAATTTTGTTTTCATCTATACTATACTCATCCTTATTTTCCAGTTTTTATTTCCAAGTTTCCTCTCTACAATTTTATTTA 
TTTTAATTATTGTTTTAACTGTCTTCTCTCTATTCTTGCTTTCCTTTTCAAGATTCAGAAAATGTCTAATATACTCTCAT 
TTTTCCTCAAACTCAACAAAATGAATTAGAATCCTACTAACTCTTTGGAGGCATACATTTAGCATCTGGCTAGAGGAGGA 
CCTCTGATGAAATTTAAATATACTAAAACTGCCTTTCTGAATTGCTGTTAGTCCCTGCTACCAAACTTCTCTCCTGTTTT 
TTCTTTTCGTTTTGTTTTGTTTTTGTTTTTGTTTTTGAGGCAGCGTCTTGCCTTGTCACCCAGGCTGGAGTGCAGTGATG 
CAGCCTTGGCTCACTACAGCCTTGACCTCCTGGGCTCAGCCTCCCACCTCAACCGCCCAAGTAGCTGAGGCTACAGGAGC 
ATGCTACCACACCTGGCTGATTTTTTAATTTTTTTGCAGAGATGGGGTCTCCCTATGGTGTCTAGGATGATCTGAACTTC 
TGGGTTCAAGTGATCCTCCTGCCTCAGCCTCCCAAAGTGCTGGGATTACAGGCATGAGCCACTGCACTCAGCCAGTATTT 
TTTTTTCCCCCGAAAGCTCTTCTCCTTACTTATCGCCATACAGGACTACTTAGCGAGGTGTCTAGTTCAGTTTGAAGGCT 
ACCACTGTCCCAAAAGTGCTCAGATACCCCTTCTTGCCCTGTGAAATACTGTGATACAACAATAAATTCACTCTCCAGCA 
CATTGTTTGGACAATGACCTCTGGTTGCTCTTCTTAAGTTTCCAGTGGATTAAATTCTCTCTGATGCTCTTCTCCTCTTT 
CCAAGGGAGGCTTATATGAGACAGTTAATGAGGTCTACAAGCTGGTCATCCCCATCCTAGAAGCGCATCGAGAATTCCGG 
AAGCTGACACTCACTCACAGCAAGCTGCAGAGAGCCTTCGACAGCATCGTTAACAAGGTAGCCGGGGAGCCTGGCTGGCA 
GGTCTTGTTACCTGGTGGCAGGCGACCCTGTCCTACAGATGCTTAGCCATCCTTCCTCTCCAGGGAGTGATTTATCTTTA 
GCACATTGCTTTTGCTCTCACCTGTCAAACAGAAAAGGGCTGAAATTCTTCTAACAGAGGACCAAAATTCCATATGTGAA 
AACATACAGCTTAAATTACTTTATAACCAGGAAATGTGAGAAATTTTTAAGTGTAATTAAAAGAAGTCCCAGAAATCTTT 
CATGGGATTCCTTTTGTTGTTATTTCTGAAGTTTATTCCATAAGCATTAAATTTTTTTAAGGAGTAATTTCTGTTTACAT 
CAGCCATAGGAGTAAAATGCTTTGTTAACACAATGAGAGACCCCTGCCCTTTGCAACTCAGTGGCTCCTCAGGATGACAT 
AACTAAGGAGAGCTTTTTATATTTTGTTCCTCAGGATCATAAGAGAATGTTTGGAACCTACTTCCGAGTTGGTTTCTTTG 
GATCCAAATTTGGGGATTTGGATGAACAGGAGTTTGTCTACAAAGAGCCTGCAATTACCAAGCTTCCTGAGATCTCACAT 
AGACTAGAGGTAAGAAAAGTGATTCTGTGCGCCTGACCTGGTACACTTTACAAAAACAAGTTAGAGTGGGTCATCACCAA 
AAATAAACAAATAAAGAGTTATGGATTCATCATTGATCTCTACATAAAGTTTTCCCCTTTGCATTTATAAAAGGCAAAGT 
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AGAAAGGTATTAGGTGAGATCATGAGGTGATGTAATATTTTGATAGTTTTTCCCTAATACTCTGTGTATGCTTTTCACAG 
TTTGGAATTTTATATGGTGAATATTTATTTTGAAGTCTGTGCAAAATTCAATCAAGGTCATGTGCTTTCTTATCACCCTT 
TCCAAATATTAGTAGTTTATACTAGTAGATAGAGAGTAGTAGAGTTTTTCAACATGAAGTTTAGCATCTTGACTTTGAAG 
TAATCTAGCCAAATGACTGAATCACCCCTAGATAATGGTGAGGCCATCCTTTAGGTATCACTGGATGGCACCTGGGGTCC 
TTCTGATGTGGAGCTACTCAGCTGTGGCCAGCCCGGCTCCAACCTGCCAGTGAGTGACATTTGGTCCATAATACAACAGA 
AAAGTGTAGCATGTTGTTAGAAGCAAAGCTGAAAGCATGGAGAAAAAGAGAAACAGCCTAGAGAAAGGTCAGGACAAAAG 
AAACACAGGTTATAAGGGCCAAACACTAAGAACTCAGTCAAAGGTGCCTCCCCGATGGGATTTGGATGAACAGGAATTTG 
TTTACAAAGAGCCTGCAATTACCAAGCAGGACTGGGGTCCTTTGAACTGCTCACAATTAACCAAAGATTCACTCCTTAGG 
CTTTCATTGCTTTGAATTTGTGCCCTCCTATGAGTGTAAAACTTAAAACATTTGTGTTGCCTAATGGCTTCGTACACTCT 
CCATCCATGCCAAAAAAGACAAGAGGGTTTATCCTTACTTGACTCAAAGAAGCCATTGCCCAGGGTGGCTGGGTGAGTTG 
CCCCAAAATTGTTTAGCACATGAGAAAAAGAGCTAGCAATTGACCCCAGTCCACTTGGTTCCCTATATTGTTTCCCCTAC 
TTAGAATTAGAGGTATATATTTTATCTTCTTGTGCACTGTGATTTGCCACAATATGGGAAGGCTGGTGACTTCCAAGTTC 
CCAGGGTACAGAAGGCGAGAAGTAAAGAGTGTGATACCCCAGGAGATACCTCAGCAAATATAATGATGTTAGCTGAATTA 
GAGGCCAAGCATACATCTTATAGGGAAGCATATACCAGTTGACAGTGCTATTTTTATTTTTGTCTTAGGAAATGCTGAAC 
TTTGCTCATTAGCTCAGAGGAACTCTCAATAATCAGTGAACATCATTCTACCTTGCACTTGCTCCAAACTTATTTCACTT 
CCAAGAAGACAAAGAGTTGCATTATGTTAAAATAACCTTTATAAACTGTTGGTTCTTCTTACCTAGGCATTTTATGGTCA 
ATGTTTTGGTGCAGAATTTGTGGAAGTGATTAAAGACTCCACTCCTGTGGACAAAACCAAGTTGGATCCTAACAAGGTAT 
ACAAAAATTTACAAAAACTAACCATCAAGCTCTAAATCCCTTCGTTCTCTACCCAAGAATACCTAATGATCTCATCTATC 
TGGACTCTCCAAGTCACTTAAATGCAGTCAAACCTTTTCGTCTAGAGTTCAACTACTAATTGGTCAGATCTTAAAGAAAA 
TATAGTCAAAGGCAGGAATCATAATAGGAGCTACCACTTATTAAGCACCAACTGTGTACCTGGAACTGCATTAGGCCCTC 
TACATACATCATTTTATTTCATCCTGCAACGACCCCTGGAAGTAGATTTTGTCATTCCTATTTTAGAGATGAGAAAACTG 
AGACATGGAAGAGTCAAGCAAGTTTCAAGGTCATGCAAGCAGCAGAGCCAGTACTCAGACTTGAGGTCTGTTGCTTCTGA 
ACCCCTACTCTTCAGCACTGCTCTTTACTGCCTTTTTATAAAACCTTTAAACTCTCCATTTCAAACTCGACAACACTTAG 
TGGCTTCCTTCTTTAGGCTGCAAGTATTCCTTCCATCCAAGTCCGGGATCACTGTGCTGTTGGGGGAATGGTAAAAACGG 
CTTGGGTTTGGGTTTCCTCACTTTCACAAGAGGGTATGTTCCATTTCACCCCAAAATGGGTGTACAGTTCTGATGCTAAC 
ACGTGGAGGTGGTATCAGCTCCCACAGGGTAAAGGCTCAGTCCTCCTCAAGACTGCCCTGACTTCAGATGCCAGCTTCAA 
GAGGGACCCCCAGGCCAGCCACGCTTCTGATCAGCCAGCTACAAATTTGGGAGTTTCTATAACCTGTTAGCTTGAAAATA 
GGAGAAAACAAAGCAAATAATAATAATAATAATAATAATAATAATTTGGGAGTTTCTATGATACCTGTTAGATTAAATAA 
TTCACTACAATGACTCACAGAATTCCAAAAAGTACTCTGGTTACTATCAGAGTATTATGATAAAGGGTGAACATCTTTTT 
GTGCCTTTATTTATGGCTTCCTGGCTCATCAGTGTGTTCTCCAACCAGGAAGCTCCGCCAAGTCTCAGTGTCCAGAGATT 
TTGTTGGGGTTTCATTATGTAGGCAAATTGAATACAGTCTCCAGCCCCTCTCCTGTCTCCAGAGGTCATCCAGTTCCTAC 
CCTGTAATCACAGAATTGGTCTTTTTGGTGGCCACTTGGCATCCTGAAGCTATCCAGGGGCCCAGCATGAGTCACCTCAT 
TAGCATCACAAAGACACCCATCACTGAGGAAATTCTAAGTGTTTCTAAAGCTCTGTGCTAGGAAATGGAGACAAAGACCA 
GACATATTCTTTATTATACCACAAGCTGATGCTGCCCACTCAGCCCAACTCACATGTCCATGAATGAGCTTTCTAAGTTA 
CTGGAAATAGTGCAAGTGCAAGGTATCATTAAGGGCCCTGGGACAGAGGACCATTCACCATCTAGCAAACCTATAAAATG 
AAAGGTCCAAACTCCCAGTTCCATCTTCCAGGATATGAAGAAGATAATGGAAGAGGGGAAGAATTGGCCAAATTGAAGAG 
TTTGTTTTTCTACATTTTTCAGAATGCTTTCTCACTTAAGACACATTCCCTAGCCTCGGCTTGAAAGCAGTGGCTGTGGT 
AAGAGTTTAACTAATCTTCAGACACACATGTCTGGGAGATGGAGTTGGCCGTGTGCCCACAGTGATCTGTACATAGCACA 
AGCTGTGTAAAATGTGACCTCCCTCAGCCAAGGTGCCCTTTTCCCCTCTTTTAAAATTCCCAGGGTAGGTTGTGAGAGTT 
TGGAATGAGAGTCTGAACCCAGAGTTACAACCAGATTTCATTATACTAAGTCGTGATTTACAGTCTGAGGTCAGTGACCC 
CACTCATCCCTTTCAGTGGGTGAGTGTCCCAGCATCTGAACTCATGGTCACTTTTTTTCCTAAGAGATTGTCGTCTTTAA 
TGAGTACTTATTCGCTCTGTGTTTGAGGGTGGAGGTGATGTGAGTGTGTGTTTAAATGAGGCCTAGGCAGTAAAATTCAG 
TTTTGGTGTTTAGTTCTATGAGTTTTGACAAACACATGTGACTACCTTCATAACCAAGATATAGAACAGATCCACCACTC 
CAAAATACTTCCCGTGCCCATTGGTAGTCCATGTTGCAATAGTTCCTTCCTTTCCGTTGCTGAGTAGTATTCTGTTGTGT 
GACTACCTCACCATTTGCTTGTTAATTCCCCACTGGAGTGACATTTGGGTTGTTTCTAGTCTTTGTTATGAATAAAGTTG 
CTGGAGACATTTGTGTACAGGTTTTTGTATGGACATAAGCTTTCATTTCTCTCCTGTACACACATAGGAGTGGGGTTGCT 
GGGTCCAATGGTAGTGCAGTTTAACTGCATAAGAGACCGCCAGCTTCTTCTGCAAAGTAGCTGTGCATTTTGCATTCCCA 
CCCTCTGTGTATGACAGCTCTAGCTGATCTCCATCCTTGCCAGCACTTGATATTGTTAGTTTTCTTTAGTTCGGCCATTC 
ATGCTTCCTCATAACTGTGAGGTAGTCATTGTCATTCTGTAAAGGAAAGTTTGTCTCTTTGGAATAGCAGTTTGCAAACT 
CTCATACCAAGGCTTCCACTGTGGATCAGTTTGCTCTGCTAGGAAGGTGCGGCCGGTGGCCCTCACTCTCCCCACAGGCA 
CATTCCCCTCTCCCTCCTTGCAGCCCCTTCTGTCCTTGGGGGTGGAGAGGAGGAACTTCTGCTGGCCACATTCTCCCCTG 
CATCTGTAGCTTTTCTGCACTGGGCAGGGCGGTGCCGGCACGCCGTGTTCCTGCATGCCCCTCCATTGCGTCAGGGATGG 
CCGTTTGCAGAATAGCTCATCTTCTCCCTCCGTGCCTTTTCCCCCTTAGGCCTACATACAGATCACTTTTGTGGAGCCCT 
ACTTTGATGAGTATGAGATGAAAGACAGGGTCACATACTTTGAGAAGAATTTCAACCTCCGGAGGTTCATGTACACCACC 
CCGTTCACCCTGGAGGGGCGGCCTCGGGGAGAGCTGCATGAGCAGTACAGAAGGAACACAGTCCTGACCACTATGCACGC 
CTTCCCCTACATCAAGACCAGGATCAGCGTCATCCAGAAGGAGGAGGTAATGCACCCAAGGGATTGGCCACCACTGGArG 
AGTGGGCTGGGTGGCCTCCCAGGAGGACCCACAAACCTCTTTCACAGTGGATTGGACTGAAAACAAGGAGGGATGCACTT 
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GAAATATGATTATATGGTTGTCCTTTCACTCTCATAGTGCCAGAAAATCCCATTTAGGCAGCTACATATTTTATAGCACT 
ATTGTATATTAAATATTTAATATTTTAAAGTTATATCTTTAGTATAAGTGTGTTTATGTATTTAATTATAATATTTAATA 
TATATTTCAATTATAAACTTGTACAACAGATATATTTACCTTTTAAATATTTTATATAAAATTTTTCTATATTTCAAAGC 
TTAGAGGTGATTCAAGCATAGTCGTGCTGTTAATTATTGGAGACGGGACCTGCACGTGGGCAGCCCCAGTGAGGCGGTGG 
TGTGCTGTGGGAAAGGGCCGTGTAAGGTAGACAAGTTCACTGACTAGCTTCTAGTCCTAGCTTCCTCCTGTGATTTTAAA 
CAAGCTACGTACCTTCAGTTTCTTCATCTATGCATTAGCAGGAAAGACCTCTAAGTACAGTACAAGATTATACTCATTTT 
ATTAATAAAAGGACTGGTCAAGAAGGTCTCTCAAACCACAGAATTGTTCAAAATTCTACACACCATAAACAACTTATTAT 
TCTTTAAAACACATATATATACACTTATTTGTCGTCCTTTTTATGCAGGGCCACAGACTTCTCTGTGACTGTGGGTTTGC 
TGATCACAATTCAGCATTTTCTTTTATAAACCACACCCGTAGTGCTTTGTCCATGATTTTCAGTTTTGCTTTGTGTAAGC 
AGAGTGAGAGCTTAAAGATCCTTGTTAAACAATTTGAGAGCAGAAGCCTTCTGGATGTTTATGATGTTTTTCTCCCCGAG 
ACTTTGACAGCAGTCTTGTGCACACCTAATATGACAGGAATTTTTATAGCAACTCACTTTCATAATATCTTGTCCAACCA 
TTTGGCTTGGTTTTCATAGAAAGAAATCTTTTTCTTTCCACACCCATGGTTCATCAGTTTCTCCATTATCTAATTAGATT 
GGGTCATTAAAATAACAAGTATAACAGGCATAATCAAGTTGGTGAACAAACACAGATGAATTGTGGTGAATATATACCTC 
ATCAGGCAGAAGCAGAAATAGCTGAGCTAACTGGAGAGTCATCCAGCAGGTAGTGTTCAGCTGGGTAACCACATCAGGGT 
GTGGCTTTCACAGAAGCTGAAGAAAGCCTGAAGCAGTGACTCAGTCGGGAGGAGGTGGGTTTAGAAGCCATCTGCTGTAC 
TTCCTATACTTCTTGGTTTGGACTTTTGAACACTGAGACATTCTAGGCATAACACAGATATAAACTCATGGTGCCCAAAA 
GCATTTCAAAACCTTTTCTTTTCTTCATTCCTGCCTTCCTTTCCTTTCCTTCCTTTCCTTTCGTTCTTCCTTTTTTTCTG 
TGTCTCTGTCTCTCTTTCTCTCTTTTTCATTTTTGAGACAGGATCTTCCTCTGTTGCCCTAGAGTGCAGTGGCATGAACA 
TGGCTCACTGCAGCCTCGACCTCCTGGGCTCGAGCAATTTTCCTACCTCAGCCTCCTGTGTAGCTGGGACTACAGGCTTG 
TGCCACCATGCCCAGCTAATTAAAAATACAGAGAGATATATATTTTGTAAGAGACAGGGGTCTTACTTTGTTGCCCAGTC 
TGGTCTCAAACTGTTGGCCTCAAACGATCCTCCTACCTCAGCCTCCCAAGGTGCTGGGATTACAAGTGTGAGCCACTGCA 
CCAGCCTGAAGTTTTCTTGGCCTTCCTTCGTTCCCCAACAACCTCAACAACAGTTCTCAGCTGTTTGCTAGGGCTGTAGA 
ACAAAGTTCCATAGACTGGGTGGCTTAAACAACTGAAATTTACTGTCTGCACAATTCTAGAGGCTAGAAGTTTGAAATCA 
AGGTGCCAGCACTGTTGGTGCCTTCTGAGGGCTGTGAGGAAAGGGTCTGTTCCAGGCCTCTCTCCCTGACCTGTGGACAG 
CCCTCTTCTCCCTGTGTCTCTTCACATCATCTTCCCTCTGTGTTCGTGTCCAGATTTCCrCTTCTCGTAAGGACACCAGT 
CATATTGGACTAGGGCTTACCCTGCTGGCCTCATTTTAACTTGATTGCCTCTGTAAAGACCCTATCTCCAAAGAAGGTCA 
CATTCTGAGGTACTGAAGGTTAGGACTTCGACATATACATTTTGGGGGAACACAATTCAACCTATAAAATTCAGAAAAGA 
CTCTACCCCAAACCAGCAGAACTTAGCAAATAGATTGATTGACCCTTAAAAGAATTCCATTTACTGGAAATTCACCCTCA 
GTTGGAGAAGGCACAGGTGATATCAAAAGCCTGTGTTATGATGGGGGAGAAAATCTTGAGTGCTGTGCTTCTACTACAGC 
TTTCTGCATTGTAAGTTGAGTAACATGAGGCTGTGTGCGGTGGCTCTTGCCTGTAATCCCAGCACTTTGGGAGGCCGAGG 
CAGGGGGATTGCCTAAGGTTGGGAGTTTGAGACCAGCCTGGCCAACATAGAGAAACCGCATCTCTACTAAAAATACAAAA 
TTATCTGGGTGTGGTGGTGCATGCCTGTAATCCCAGCTACTTGGGAGGCTGAGGCAGGAGAATTGCTTGAACCCGGGAGG 
CGGAGATTGCAGTGAGCCAAGATCATGCCATTGCACTCCAGCCTAGACAACAAGAGCAAAACTCCATCTCAAATAAATAA 
ATAAATAAATAAGTAAGTTGAGTAACTTGCTCAGTAATAGGAAAAGGCACCTGACAGGGTAGAAAAAACATGGACTTTGA 
AGATAGACATACTTGTGATCGAATTATGACCTCCTCCCTTACTACCTGTGGCATTCTGTGGAACTTAACCTTACAAATCC 
CCCTTTGACTGTAAAATGGAGAGTATAAAGAGGTTGTTGAGAGGAATAAATGCTAAATGTATGTAAAGTTCCTGGAACAT 
AAGGAATCAAGAAATGTTAGGCCCATCTTTCTTTTTAACCTGTTAAGAGTATTTTAAATTACTCTGAAAGTCTTTCCCTA 
GCTGCCTCTTCAAATTCAGGTGGCCTCTCTGTCATAGCTCCAGGCTTGTCCATAGCTTATGAGACCAGACAGTGACTTCC 
CTATGTTTACGTCTCATGTTCAGTTTGTTTTGACACCGATTGAAGTTGCCATTGAAGACATGAAGAAGAAGACCCTGCAG 
TTAGCAGTTGCCATTAACCAGGAGCCGCCTGATGCAAAGATGCTTCAGATGGTGCTGCAAGGCTCTGTGGGAGCTACTGT 
AAATCAGGTAAGCAAAACCAGAGGTGGCAGCTCCTCTGGTTCTTATTATTTAGGTTGTCATTATACGTCTGCACCCTTCT 
TCCTTGGGGTTGATGAGGACTTTGATCCATAGACAAACACAGAAATGTTCCTACACTTAACCTGAACACCTGTAAGGTTT 
AGAAGACTTTrAGGAAACCTCTTCCCTTTCATGTAACAACCCCAGGTAAAAAAGAAATCCTAGAGATGAGTGGACCAGGC 
TTTAAGGAGTACCACTTTCTCAGAGGAGTCTCCACTTCGGGGCCAGACCTGACAATATGATGCAAATCTGGAGTCATGTT 
GAAGAATGCTTAGTCATGACCAATTCACGCAGAGTAATTGCAGGGCTTGAGACTCACCTACAAATGCCTATAGGAGAGAA 
GGAAAAGGATCTAGAACATCCAACTCTTGGCTCAGTCAGCAGATGAACCCAGCATGCCAAGGACCTTGACAACCAGGAAT 
GACCTGGGACCTGACTTCTTAGGCTACTTCAGACAAGACTAGATCTTCCTATCAGACTTCTTAGAACGCTGACTCCCAAG 
TTCAGCATGGTGCTCAAGCAGTCTCAGATGAAGGGAGGTACCAGCCTAATACCTCTGTCCAGTGGGCCTCCTTAATTCAA 
ATCTAGATCTGTTCTTGTCCACACTTCCACGGCAGTTATTCAGTGAGCATCATGAGTCTTTTCTCATTCAGCGTAATTGG 
ATTTCCCCACAAAAGTTCTGAGTGTACTTGACATCAAGGGAGCAGAAACAGAGAAGAGAAATGCCTATTACATTCCCAAG 
ATCAGGAAAAAAAAATGAGGAAACGTTTGCCTTTGTAAGTGCCAATCCTTTGATAAAATGGAAGACTTTCCAAGCCCACA 
ACCATGGTCTATCTGTACACGATGGATATCTCTGACTCAATCCAGCAGTTATGCAAAATGATGTTTGCCATGAAGGCAAC 
TAGATAAGTGAACAAAAATGAATAGATCAAGGCAATCCTGATTTTTAGAAAAGCAACTCAAAGCACATACCATTTGATGT 
AGTTGTATTACACTTATATATGAATGAAAATCTATTATTAGTAGTATTTGATTGATAAAATAATATTCGGCGCCAGGCTC 
AGTGGCTCATGCCTATAATCCCAGCACTGTGGGAGGCCGAGGTGGGTGAATCACTTGAGGTCAGGAGTTTCAGACCAGCC 
TGGCCAACATGGTGAAACCCCATCTCTGCTAAAAATACAAAAATTAGCCAGTGTGGTGGCAGGCGCTTGTAGTCCCAGTA 
CTTGGGAGGCTGAGGCAGGAGAATCCCTTGAACCCGGGAGGTGGAGGTTGTAGTGAGCTGAGATTATGCCACTGCGCTCC 
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AGCCTGAGTGACAGAGTAAGACTCTGTCTCAAAAACAACAATAAAAAAGAAAATAAATTTGGGCTGTGTATGGTGCGCAC 
GCCTGTAATCCCAGCAATTTGGGAGGCTGAGGTGGGTGGATCCCTTGAGGCCAGGAGTTCAAGACCAGCCTGGGCAATGT 
GGCAAAATCCCATCACTACAAAAAAATACAAAAATAGCCAGGTATAGTGGCACACACCTGTAGTCTCAGCTACTTGGGAG 
GCTGACATGGGAGGATCACTTGAGCCTGGGAAGTGGAGGCTGCAGTGAGCAGAGATCATGCCACTGCTCTCTAGCCTGGG 
TGATAGAGCGAGACCCTGTCTAAAAAACAAAAACAAAACACACACACACAGGGAAAATACATTTATACTTCTGAGTGTTC 
CTCTAAAACTTCAGATTCCTCCTAAGAAATCAAAATTTCCTGAGATGTCCAAGACCTTCCTTATTTTTTTTTGTTTATTT 
CAAACCAACTACAAAATTAAATGTTCAAACTAAGAAGTCACTCCAATATTTTCTCCCAAGCAATTTAATTTCCTTCATAT 
ATGTATGAATATATTCATATATATGTGTGTGTATATATATATACATATATATGCATATACATATACATATGCATATACAT 
ATATATGTATGTGTATATATATATGTGTGTGTGTGTATATATATATATATATATATATTCTTTTTTTTTTTTTTTTTTTT 
TTTCCCACCAGGGACCACTGGAAGTAGCCCAAGTGTTTTTGGCTGAAATTCCTGCTGATCCAAAACTCTATCGACATCAC 
AACAAGTTGAGGTTATGCTTTAAGGAATTCATCATGAGGTAAGAAGGAAAATGGCTGGGAATTTCAGTAGAGCAGTGGTT 
CTCAAAGTGCAATCTTAGACCAGCAGCTTCAGCATCACCTGAGAACTTGCTAGAAAGACAAAGTCTCAGGCACCCTCCAG 
ACCTGCTGAATTGGAATCTCTGGAGGTGTGGCTCAGCAATGTGCAGTTTAACCCGCCCTCCAGGTGATCCTGATGCACAC 
TGAAGTTTGAGGACCACTGCAGCAGAGCAACACTACTTGAACTACTAATGAGTAACTAACACGTCAACTATGAAACGCTT 
TTGTGGCTAGCATCCCGTGTGCCTCACAATCACTTGTTGTAAAACAAGTATCATCATCTTCCTCATTTTACAAAAGAGGA 
ATCAGAGGTTCAGAGAGAGGAGATAATTTTACTTAAGGTCACACAGACAGTTGGCAGCAGAGATGAGCTCAAACCAGGTC 
TTCTGAATCCAAATAGTCCACATGTCCATCAATGTGCTGATATTCCACTGATGCACTAGAGTCCCAGAGGTTCTTTGCAT 
TGGCAGTCATTGTAATGTTACAAATCTTATAATATCTTATTTTTAGAAACTTAAAAACATACACTGCCTCAAATTGAGAG 
AGGCAAGTTCTAAAACCAGGCTTTTCAAAGGTATTATCCTGTGACAGTTCCCCATGTCAGACAAGTCTCATCTCTGTTCC 
TTAATTCTGCAGTGGAGCTGGCTTCAAGAGCAGTCAGTTAGGAACTGGGTCTTTGGTCAAAGCCTTGTAATTTATTGCAA 
TACAGACAGATATATTGGGTCTCCTATGGTCCTAAAAATATGAGTATGAAGGATGTGATGGCATTTCACCTATAAGAGAA 
AAGGACTTCAGAAAAATGTGTAGCAATATTTCTTTCTATTCAAAACCTTAGAGAAGAATATTAAGTATAAGAATATCTTC 
ACTTGGCAGATGGGGATAAAGGAGACTAAAAGTTTGCTAAATTCAGTGAAAACTCATCAAAATGCTAGCATTTGTGAGGT 
TAGAGATATCTCCTCCTTCACAGGCTTTTGCGAGAGGTCCGCAATTTTGCTGCATATACAGGAGACACACTGTGCCCTGC 
AGTGAATCACAGCCGCTTGGTAATAGGATGTTTGTGTGGCTTTGCGTGTTTGTATTTGTGCATGTGTGTGAACATACTCA 
CAGATTATCTTGATTAATGTTTTGATATTTGAATTGTTTCCTGTATTTGACAACAATTCTGACAGAAAAACTAGTTGTTG 
TTGTTGTTTTTTCAGTACATAATACGTATCTTTTAAAAGGCTAGAAAAATATTTGGGGAATTATTTTATTATTTTATTTT 
ATTTTGAGACAAGGTCTCACTCTGTTGCCCAGGCTGGAGTACAGTGGCATAATCTCAGCTCACTGCAACCTCCACTTCCC 
AGGCTCAAGAGATTCTCCTCCCTGAGCCTCCCAAGTAGCTGGGACTACAGACGTGTGCCACCACATCTGGCTAATTTTTG 
TATTTTTGGTAGAAATGGGGTTTCACCATGTTGGCCAGGCTGATCTTGAACTCCTGACCTCAAATGATCCACCTGCCTTG 
GTCTCCCAAAGTGCTGGGATTACAGGCATATACCACCGTGCCCGGCCCTATTTTTTATTTTTTGTGGAGACAGGGTCTCA 
CTATGCCGTCTAGGCTGGTCTTGAATTCCTGGGCTCAGAGATTGGGGGCTTAAAAAAATTTTTTTTCATATATTTTTATT 
TAAAATTCATTATGGTTGAATGCTTCCAAAGTTGACTATGCCCCAGGACCTCTAAAAGGACCTATGAAATGTTTGAAGAG 
CACTCACTATATTCCAGGCATGATATTAGTTATGGGACTACAGAGATCTGGCTTGTTCCTCTGTTCTTAATGACAGGACA 
TAGCCCTGCATGGCTGAATAATAGAATGCAGCATATGCACAGAACTTTAAGAATGACTAACTACAGGACTATAAAGATCA 
GAAAAGAGAAATCAGTTTCAGGCTGGAGCATTTGTGGGAGGCGTTATGGAAAAACACCACTTAAATCTTGTCTTGACTAT 
ATACGTGGAAGAAAGGCGCTAGTGGGGAAGAGGGAGCCAGGACAGTGGCCCTGCCTGATCCAAGGTATCACTTTCAGCTG 
CTTTAGTTACCCATGGTTTGAAAGTATTACTTACAATATGATATTCAGAGAGAGAGAGAGAGACCACATTCACATAACTT 
TTATTTCAGTATATTGCTGTAATTGTTGTTAATCTCTTACTCTATCTAATTTGTAAATTAAGCTTTATCATAGGTATGTA 
TGTATAGGAAAAAAACATAATATATGTAGGGTTCGGTACCATCCGCAGTTTCAGGCATCCCCTGGGGGTCTTGGAACCTA 
TCCCCGAGAATGAGGAGGACTGAGTACCACTGAGTGGACCAGCAGGACCACGAGACTCTGTCAGGAATGCCTGTGATGCA 
CCGGGTTCCTCTAAGTCCAGCTGGAGCAGCGTGTTCTCATGAGCATGCTAGGACAAGGAGGAAAGATTGGAAATATGATA 
ATATCAGTAAATGAGGCTCTGAATACAATTTAAATGTTTAAGAAAACTTGTCAGGAGAGTGATTTTCATTTACTTCAGTT 
TGCCrCAATTTTTTCTACTAGTTATAAATCTTGCAATCCAGTATCTCACGGGTATGATCCTTTCCATGACACCACCTAGG 
TAAAATGTACTTACAAAATCCCTACTAAACATGCGTTTCTCATTAGCTCCCAGCCCCAGGAAAAGTTACTCCAAAAAGAA 
CCAGTGCCCAGTAAAACTAGTCCCAGGCAAGAGATACAAGAGGAAATGGACTTTCCTAGGGGTCGCTCCTTTAACAAAGT 
CATTAGAGTTTCAATAATGAGGCTATCATTCTCCCCAGATGGCCTACAGAAATGTTGGATTTTCAAAATCAAATTCACCA 
TTTTAAAGTTGTTCTCCTACCCTTTTAATGGTGATTCCACATGGGAACCAGTCCCACCTTTCTCCCCACTGGCTGGGCTG 
CTGCATTGTTCTAGGAGGCCAGGAGACTGTCCATGCTTCCCAGACTTCACTGTTCTTACACGTCACCTGAGGATTTGTCA 
AAATGCAGATTTTGGCCTGATGCAGTGGCTCCGCCTCTAATCCCAGCACTTTGGGAGGCTGAGGTGGAAGGACTGCTTGA 
GCCCAGGAGTTCAGGACCAGCCTGGGCAACATGGCGAAACTCTGTCTCTTTAAAAAAATACAAAAAAATACAAAAATTTA 
GCCAAGCGTGGTAGTGGTGTGTGCCTGTAGTCCCAGCTACTCGAGAGGCTGAGGTGCGAGGATAGCTTGAGCCCGAGGAG 
GTCAAGGCTGCAATGAGCCATGATCGTGCCACTGCACTCCAGCCTGGGTGACAGAGAAAGACCCTGTCTCAAAAAAAGAA 
AAAGAAAAATGCAGATTCTGATTCAGTAGGTCTGGGGTTGGTCCTGAAGCACTGCATTTCTAACAGGCCCCCAGGTGATG 
CTGCTGCTGCTACACTGGACCAAACTTTGAGTGGAGAGCCTGCAAAGATTGCTTAATTTCCTCCCTTTCACATTTCTCCC 
TATTACCTCTCCCTCCTTCCCTGAGGGCTTAATCTCTTAGGCCATCCTTATCATCATTTACCACTTTTGCCGTCTGCTTC 
CTCCCAAAAAATAAATAAATAAATAAATAATAAAATAAAATTTTAATGGCAAAAAAAGAAGTTGTAAATCTCAATAAAAT 
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TATGTGAGAACAAAAAAAAAAGCAAACATGCTCAAACTGCTGCACCTTGAACCAACTATAGACGTCAGTTCATCCATTCA 
TTCAGCAGTTATTGAGCTCCTACTTTGTGCCAGGCACTGTGCTACATATGTGGAAGGATGAAGTCCCAATATCAGTAGGA 
CAAGGCTACAGACAAAACAGTACTGCTGCTCATTATTCATCTCTATGTGCAAACACAACAGATGGCCTGCCCTCCACTTC 
ATTCTACAGAGATCAGAAGTCGTGGGTGAGAGTGTGCTGATGGTGCACAACTGTGTAAATTACTAAAACTCATCAAACTC 
TAAACATAAAATGGGTGAATGTTACGGTATGTAAATCATACCCrGAATACAGATAGTAGATGAGGTTCCTCTGTTGCTTA 
ATTGCTTCTCAAGTCCAATCTGGAAGTTACAGCTTTTAATGAATAGAGTAACTATTTTTTCACGTTGTTTGTGCTGTGTA 
TTTTCTCCTCCACTCTGCTAACAGACAGCAGAGGTGACGCAGAAGTGGAAAATATTTGGGGCTTGAACAGCTCTCAGGTT 
TCCCCTTTAGTAAGCTCCAGCTTCTCAGCAGACTTGGGCTGTAGATCGATAAGCCCAGATTTCCAAGGTGACCATAAATG 
AGTCTCTGTTGCTGAGAGCTAATAGCGGAAACATTTGCTTGCCAGGTCCACTGCGTCTAGTGCCACCACGGGGCAGCGAC 
CTTTCTGACTCAGCTGTAGTGGAAGCAGAAACAGCCATAAAGAATCCTGGCAGCCTGATTTGCTGCAGCCAGTACTCATC 
CAGCCAGTCCTGCAACTCTTCAAACTGTTACCAAGCTGGGACCTCAATCAGCTTCTGTCTTTTCCTGCAATAATAAAAAA 
CATCTCGGGTTCAGAAGCCAAGACAAGAAGATAGAGAGATACAATCTTCTATCTTCCTCTTCTGCTCCCCCAAACACCCC 
GACCATTTAGATTTTTAAATTTTCTTCTTTATACTTATCTGGACTTTCTCATTCTTATAAAGACTAAGTGATGTTAAATT 
TGTTTTTAATTTAAATAAAAGATAGTAGTACTCACTGGCACCAGTTACAGCTTGCCTTTAAGAGAAGTAGTTTCAGATAC 
ACCCTGAAAGGGTTCTGCAGCATATATGTGGTCATAGGCTCAGAAAACAGGCTGTTGAGTGGTGGCGTTTTTAAACTGGA 
GTTGGGGTCTATGAAGAGGAAGGAGAAGGAATTTGTCCGAGTAGCCAAAGGACAACCAAATGTGTAGAGTGTAGGTGGAA 
AAAGAAGCAGTAGTTTTAACTTGAGACCAAGGCCATATGCCTGGCTTATAGCTGGAAATGGGGAAATGGCTTTCCTAGGC 
AGTATATGTGGCGTTGGGGTTGGGAATATGGGCACTCAAGCCAGATTGCCTGAGTTCAGATCCCATTCTGCCTCAACTAG 
ATGTGTGACCCTGAGAAGTTACCTACCCTCGGTTTCCCCAGATGTAAAATCGGAATAATAATTGTACCTCTCCCTTGCAG 
TTAGTGACAGAATAAAATGAGTTAATACATGGAACTTAGAATAAGACTTCATACATACTAAGGGATCAGTAAGTGTAACA 
TTGTTCAGTGGGGCAAATAGGGGACTGATGGATTTGAGTGGGAAATAGAGAATTAATCTGACTTAAATACGGAGATTGTC 
TATCCATGATTTGTCTGTCTCTATAAAGTTTGAATCATAAGACACAGTGATGCTGATGAGACATTGGCCTGGGAGCAGCA 
GGATTCTGGGTTTATATCCAGCTGTGCTGTCCCACAGGTATGTGACTGGACAGGGCACTTCACCTCTTTGCATGTTAGTT 
TCATCAACTATGAAATAAAGAGACTAGAATACAGCATCTCTAATAGTTTATCATTCTCATATTGTACAAATAGTTCATTT 
ACTTAGCCTGGGTCTGTCAGGCATAATAACGCTACCATGTGCTCTGGCTTCAGCTGTGTGCAGGGACTCTTCTGAACATT 
TGATATGTTTCAACTAATTTAATCTTTACATTAATTTATGAGGTAGGCTCTTATCACCCACACATCACAGATGAAGAAAC 
TATGACATGAAGAGGTTAAGTAGCTTGTTTAAGGTTGCAAAGCCAGTAAGCAGCAAAGCGGGATTCAGAGTTGAGCACTC 
TGGCTCCAGAGTCCATCCTCTTAATTGCCATGCTGAGCTGTTCCCTCTAGTGACTATATTCAGTTGCTAGTAACAGAAGG 
AAGAGTAGCTTAAATAAGAAATTTATTTTTCTCTCACATTAAATAAGATTGGAGGTAGTCGATGTAGAGCTGTGTAGTGG 
CCTCATAAAGTCATCAGAGACCCTGGTTCTTTTCCAATCCTTTGCCATGCCATCCTGGTTCTAGTGTACCCATTCTCGTG 
GTCATGATATGGTTGCTAGGGCTCCAGCCATCATGACCACATCTAGGCAAGTCAGGAGTAGAAATGAGGAAACAGCAAAA 
AGATGTGCCCATTTCCCAGTGCCTTCACCTATATTATCAGCGATCCCTACCTGCATGGGAGGCTAGGAAGTGTAAGTTTT 
CAGGTGGTCACACTGCCTGGAGTTCTGCCAGTAGGGAAGAAAGAATGGATATTGAGAAAACAACTAACGAATGTTTGTCT 
GCCACACTGAGGAACCCATGTATGGGCTGTGCTGAAAAAGGGGGGCCAAGGCTGGGTACAGTGGCTACGCCTGTAATCCC 
AGTACTTTGGGAGGCTGAGGTGGGCGGATCACTTGAGCTCACAAGTTCGAGACCAGCCTGGGCAACATGGCAAAACCTCG 
TCTTTACAAAAAATACAAAAAAAATTAACCGGGTGTAGTGGCGTGCCTGTAGTTCCAACTGCTCGGGAAGCTGAGGTGGG 
AGGATCACTTGAGCCCAGGAGGCAGAGGTTGCAGTGAGCTGAGATCATGCCACTGCCTTCCACCCTGCATGACAGAATGA 
GATCCTCTCTCTAAAAATAGAGGGGGTACCAAGAGATGCAGGGGGGGTGAGGGCAGCATGACTACTCTCTCTGTAGGAGA 
CCTTAACTCTATAAATGGAGGCCCCAAAATGTTACTGCCATCAAAAGCCAGGAATCCTTTTCTGGAGGCGTAACTTCCTG 
CCCTTTCTAATCCCTATCAATCTGGTTTCTGTAGAACTGTGACTGCTAGAAAACCCCAGGCATATTTGTTCTAAGAAAAT 
ACTTGTGTTCGGTGAATTTACCAACAAAGGGAGCATCAGAGGATGTGAGGGAAGTCTGGAATGGTTGTATCACTAAGTGA 
GAGCAGCACAGATGTTTGTGGACCTATTGAGAATGTTACAGATAAGACCATTTTTGAAAAGTTGTTTGCAGTGTCATTTT 
ATGATCTTGTGTACATTTTCCAAGCGATGTGGCTATTCTCTAGGAGGGATAGTAGAAATTATTTCAATTTTAATCAAATA 
ACCTAGAGAATATAACCCAAATGACTGAAAGGAAGAAATGTAGAAAAAGTATATAAAATAATTTTTTGCATTATAAAAGT 
TTAAAGACATAAAGTAATATTACTACATAAAATCTAAGTTTTTTACTCCAGCTATTAATATGTTTTTCTTTATAAAACAT 
CACATTTATTAATTGCTGTGTAACAAACTACCTCACAATTTAGTGGCTTAAAAGAAAATTTAATTATTATGCATGTGGTA 
CATAATAATTTTTGCTTTCCTCATTTCTACTCCTGATACTTGCCTATGATGTGGTCATGATGGCTGGGGCCCTAGCGAGG 
TGTATTGTGGCCATGAGAATGGTTTTGCTGCAACTTGGGGTTGGCTGGGCTCAGCTAAGCAGTTTTTGCCTGGAGTCTCT 
CAGTTGCACTGAGACAGTGACTAGGACTTGAATTACCTGAATGCTTCCTCACTCATGTCTGGTGTCTGGGCTGAGAATAC 
TCAAACAGCAGGGGCTCCTCCTCTCTCTAACTCTCCTTCTCTCCACATGGTCTCTCCTGCATGACAGCTTTAGGGTAACT 
AGACTTGTCACAGGATGGTTCAGAACTCTTATAGTACAGAACGAGAGAGAGACAGAGGCCATATTACCTTTTATTAGCCA 
GTCACACAGTACCACCTCCACCACATCCTATTCAGTGTGGCTGTCACAAAGAACCATCCACCTCTTGATGGGGGCATGGG 
CAAGGTTCTAGGAGAGCACATTCTTGTGGCCATTTTCAGAAATACAATCTGCTACACTCCAGAAGCATCTGTTTGCCTTA 
TTATCATAGCTTCTGATTTAATTCTCCCAGCAACCCTAGGAGATTATATGAGCTTATTATCCCCATTTTCCAGAGAAAAT 
TGAAGATTTTTATGGTTAAGTCATCCTGCTAATGAGTAGCAGCTTGTTTCAACCGCAGTTGTCTGCATATAACACATTTA 
ACTCAGTCAAGGGCACATGGTGAGCAATCAGAAGGTGTTAGCTAGCTGCTGCTACTTCTGCAAACAAAGACTAAGGAGAG 
TTTAGCTGATTTTACCAAAGTTGTCCGAGTAGACTTCCCCAAATTAGCTGTTAGCTTTGGCAGGACTCATCCCACCTCCT 
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GTAGCCTGCCAATGTTATAAACTTGGGCCTTGAAGTCATCCTGAAACTGATCTTATTTTACACATGGTGGTAATTGTAAC 
TGTTAGTTAAGAAGCAATAAAAAAATAATAAGGGGTGATATAGATTTCTCAGGATTCAAAGGCCATATGCCTTCCCTTCT 
ATTTCTTCCCTAAAACCGTTAATTCATAACCTTAGCTATCCTATAATTTGCTTCAATTAATGACAAACATAAAAGATAGG 
TGAAAATCCAGTTAAGGGAGTTCTTTCAGATTCTGTGTGAGTTAAATCAAGTCCTTCTTCTATAGCAGGCTAATTCCATG 
ATGAAGCTGTGGTGCTAAGTTTATTTTTGCCTAGGCAGGTAGCTCTTCCTTGCCACATGCTATCCTCTTCCTCTTTGACT 
GTCATCCTGAAAATGTGTTCAGTGTTAAGAGGATTTGATAACAATGTCTGGGTATCTCCATAACTTAGTCCAAATCTGGT 
ACCACTGTTAGGAAAATAATGTAAGCCATATTGATCACCCACGTCTTATTCATTTGAAGGATAATATAGTTATTACTAAT 
TATTGCGTGTTAGGTAACATACAGCCATTATTCTGTCCCTTTAAGGTTTATTGATTTGTAAGTTCCCTTGGGTAGAAGAG 
TACAAAACAAAAGGCAAAACTTCATCATGTTTAAATAACGTGATTTTCTTCTTTGGTGTGTACCGTACCGTGCAGGAGAC 
AGGGAAAACACCAGGTTTAACGGTCCACAAGCCAGATAACCATGATGAGTTCTAAAGATTTTCTCTGTCTTTAATGTTCT 
TCAGTTTTGTCATGAGAAGACTGGATATAGCTCTGTTGTTGTTCATTAGAGGATTTATATCTCTCTTCAACTCTAGAAAA 
TTACCAGCTGTTATCTCTTTAAATATTGTCTCCCTCTCATTTTCCATTCCCTTGGAGATTCTATTAGATGTGGGTTGGAT 
CTTCCTTATATCTCTATGTTTCTTAACCAATTTTTTCATATTTTTTTATTTTTTAAATCTCATTGGTGATTTCCTTGGAT 
CTGTCATACATTTACTTCTCCAATTATGTCTAGTCTATTATTTAACCTTTAACCTTTCCATTGAGTTTTTTGTCTTTTAG 
CAACTGCATTTTTTTTTTTTTTTTTTTTTTTTTTTTTGGTGAGACAAGGTCTGGTTCTGTCACCCAGGCTGGAGTGCAGT 
GGCATGATCTTGGCTCACTGCAACCTCCACCTCCCAGGCTCAAGCCATCTTCCCACCTCAGCTTCCCAAATAGCTGGGAC 
TGCAGTCACCTGCCACCATACCCAGCTAATTTTTATATTTTTTGTAGAGATGGGGTTTTGCCATGTTGCCCAGTCTAGTC 
TTTAACTTGTGAGCTCAGGCAATCCACCCACCTTAGCTTCCCAAAGTGCTGGGATTACAGGCATGAGGCACCACACCCAG 
CCTAGTGACTGCATTTTTCTTTTTTCTAGAAGTTCTGTTTGGTCCTTTTCTGAACTTTTCCTGGTCTCTTTTTTCATTTT 
CCCTGTTTATTTGTTTTTATCATGGATTTTATACCTTTTATTTATTTTCTTTTTTTTTTTCTAGTTACATGATTACTTAG 
GATTTTATACATTTTATTACCTCTCTATATTAACTTTGGTTTTTTACATTGTTTTATTATCTCTAATTCTTATTGGACTA 
ATTCTTTTGTTTGATGCATCTGTTAGTTTTCCCTCGTGGTGGTTGGTTTCTTCATATGGTTTGTAATTTTTTATTGTGAG 
CTCATCTTTGGCGAGAGTGGCTCATATGCCCTGATTGAGAATGTGTTCCTCCAGAACAACTTTATGTTGGTTTGGCTGAA 
TCCTAGCAATTTCAGTAATCTTGGACTGGCTTTTAAGTTATTTTCTCACCTTGAAGCACATACAGTCMGGAATGTACAT 
TTGTAACTTATACTATGTGTGGTGCAAGCCTAGAATTTCCATTTCTCAATATGACTTTCTTTTCCATAAATGGCCCTAAG 
CTGATAGCAAGTTTTCATTCTGCCTCTCTGGACATCTTGCAGCATTTTTCTAAACCCTCTTTCATAGATGGGATAGCTTT 
TCAAGGCTCTGGACGATATGCAGGrGGCATATCCAGAATCTCCCTTTGCCCTGGCCAAGGCCACATCACTAATCTTGTGT 
GAGCTTTGAAGCCCCTTCCCCTCAGCCCATAGACCTATACACAGTCCTAAAATCTTAATGGGCAGTCCTACTGACAGCTG 
CCTTGTCACCAGCTCCTGTGATCATTCTAGCTTTGATTTTTCTCTTTCTTTCTGGCACCTGGTGTTCCCCTTTCTCTTTC 
AACAAGTTTTTGCTGTGCTTTCCTAGCACTTCCATATGTACGTAGCAGGAGGAGGCTGAATGCCATCTGCTCTGTCTGCC 
ATGTTGCTGTAAATCACAGTGAGTTTTTTGTAAGTGTAACAGCTTCCATTCTGCAGTGTGTTTTGAGTCTGACTCTTAGA 
TCCATCACTTCCTCACAGTGTTATCTTGGGCAAGCATCCTAACATTTCTGAGTTTCAGAAACAAAATAGAGATAAATGCT 
GACTTTTTAGGGTTGTTGGGAAAATTAAACAGATAATGCATGTAAAACTTCTTGAAACTTCTTCTGGCACATAGCAAGTC 
AGAGGTTCTCAATTCTGGCTAGGTTGGCTTCAGTATCCCCTGGGCAACTTTTTAACATTAGACATTTCTGGGCCAGAAGC 
AATGGCCCACACTTGTAGTCCCAGCTACCCAGGAGGCTGAGGTAGGAGGATCACTGGGGCCCAGGAGGTCGAAGTTTGCC 
ATGAGCTGTGATCATGCCACTGCACTCCAGCCTGGGTGATAGAGTGAGCCCCGTCTCAAAAAATAATAAAATAAGGTAAA 
AAAAAAAAACAAACACAGAGATTTCTGGGTGCTACCCCTCAGCAGTTATGATTCATTAAGTCTGAGATAGATCCCAGAAA 
TCTGCATTTTGAAAAGCTCCACAGGTGATCCCGATATGCCACCCAGTTTGAAAACGTTCTTAAAGTTATTCGAAAAATCG 
TAAGTAATTTCATTGTTCTCAGATTTCTAAGCACTTCAAAGTCATTTATTTCTCCCACACTGATATTTTCATCTCAGATG 
TGGTGAAGCTGTAGAGAAAAACAAGCGTCTCATCACGGCAGACCAGAGGGAATATCAGCAGGAACTCAAAAAGAACTATA 
ACAAGCTAAAAGAGAACCTCAGGCCAATGATCGAGCGGAAAATTCCAGAACTGTACAAGCCAATATTCAGAGTTGAGAGT 
CAAAAGAGGTAAGAACAGGGCAGAGGAGGCCTCTTCCTGTGGGATAAAGAGCAGCGCATGGGGCCTAGCACCTTGGGGCA 
TGCTCTGCTGCACTTGGGGAGCTGCAGAACCTCGAAAGGGTGGAAGAGGGTCCCACAGTCAGAGAGGCTACCAGAGTGTG 
ATTCATTCTGCCTCTGTCCTCCCCATCCCTGCTCCTTGACCTCTCCCAGACACCTTGGTGTTGGTCTTGTGCCAGGGTAT 
TCCCAGGGCTGAATGATGGCCTGTGTTGGTTTTTTTGTTTGTTTGGTTGGTTTGTATGTTTGTTTTGGCACAGTGTGAAG 
GGGTCGCAGACCTTTCACTTATTATTTGCTGAGTTGTCCATGACTGATGTCCATTTCTACTGGGTGATCCACCCCCAACC 
CTTTCTAAAAGGCTAACTGATCTTTTCTTGCTTCTGTACGCTCTCTTTCCCTCTCCTCCCTCTCTTrTCTTAATTTCAGG 
GACTCCTTCCACAGATCTAGTTTCAGGAAATGTGAAACCCAGTTGTCACAGGGCAGCTAAGAAAAGCCATCTTCATTCGT 
GGAGACTGTGGCCCTGCAACCCTGGAGAAGGACTTGCTGGTACTTAAAAAATGGGACATTTGCCACCCAGGACTGACTGT 
ACACTCCCTGATCAGCCAGCACTCTGGAAGCTTTGGGATCCCAGGAACCATGGAATTATTCCCAAATGGACTCTGACCAG 
ATTTTTGCCATACTGGGGGGTGGCGGGATGGAGGATGGGTACTCAGGCATGACTGCGTATTTATTAAAGTGTGTTTTTCC 
ACAATGTACCAAACAAGGCATAAGCAGCTTCTCCTGCTGACTGGCCAATCACTGCCCATCTGAGAGATGATTTCCTCTGG 
CCCATATTTGAATTTATTGGAGTAACTCAAATTGCCTGAGGAAAAATGGAAAAATTATCCACCAGTCGATTCAAACTGAA 
TTTCACTCTTTATAGGAAGGCAGGGCAAACTTGTAGGAGTACGAAACATTTTCAATAAATCTACAAAGGGAAGCCTTACT 
ACAATTCCAAAAATCATCATGGTTGGAAATTTGGGAGGAGATTATTTGTGAACTTGTTACCCTTTTGGTAATGGTGGACT 
AATTGCTGTATAGTTATTTTTGTTTTATTATTACTGTTACATTAATTTAACATGCATTTATAGAAGAATACATTCAAAGC 
ACTGATGTAGGAGATACACGGTACTTGGAGCAGTCAGCCAGAAATCACAGATACTGCTTTCACTTAAATGGAAACAATTC 
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TCCGATAATGCTTTGCTTTTTTTCTTATGTCACTCTTGTGTACTATCTATTTTTCTCCTCTCTGGGACCAAGTTTCTTTT 
TATAAAGCAATAATATCTCTGTTTTCATTTCAGAACATTGTGCTGTCTGTCAGCATATGTATATCAGCTACAAAATATAT 
TCAACTTTGACTTCTTTTGACAAAGGACTTTAGGAAAAAGAGGAACAAAGACATTATTTGAGAATTAAATTATATATTTT 
TAATATGACTGTGACCTTGACTGATAATAAAGATGTAATAAGAATTGCAAGCTAAATGTTTCCCTTTGCAACTCATGCTT 
TGTGTTTTGTTTTGATGACCTACTCGCTCGTAATGTTTTGTAAGGCACTTCAGAGAGAAGACAGATGCATCATCCTGGCC 
TCCATCAAATAACACTATCCAAGGTGGCACCTCTTCTGCT^TGTTTAACCCTGCTAGTAATGAACGATGACTTAGTTCGG 
ATATTTCAGAACTTTTTGTTTATACCATCAGGTATGCATGAATTTATAATCTGAAAGAGGACTTAAAATAATAATTAAAA 
CTTACCAGCTTAAGTGCTAAACTTTTTATTTTTTAGGTATTTGGGGAAGAACTCTTTTTAAAGTATACACCTAACTGCTT 
TTTAAAATGAGTACACATGACATACTTTAATTCCATATGTATTCCCCTACTCTTTGGGAGACACTGTGTTGAGACCAAGG 
TCAAAAAACGTGGTCACCGCCCTCCAAATCGTCTCCGTTCCCTGAGGAAGATCATATACCTGTGTAGTAGCCACAGTACA 
AAACAGACTAGAACACAGCCCATAGCATGTAACCTTTTCCTGACTAACTCAAGGATAGGCCAACACCTATGGTATTAGAT 
TCTGCCCTAAAACAATAAGAGTTAGATGCTAAGTTATATAGTCCTGGACCTTAACTCAAATAGCCAGAATAGCCCTAGTA 
ACCTAGAATATTCCTGATTAAATATCCCCTGCTTTTAGATACCTGTTGTCCATTTGGGTTTGTTTTTTACAGTCTCTTTT 
GTACCACAGTGGATACATTTGCTTCATGAGTGCAGGAACCATGTTCACTGCTGCATTCTTACCCCTAGCCCTGCAACAAA 
CACACAAAAGATACCCAATAAATATTTGTTGATTCACTAAATGAATGAATGATGAGTAGGCCTGCTTCTAGAAGTGCACT 
GCCAATAAGAATGTAATGCAAGCCACATATATAATTTTAAAAATTCTAGTAGCCATATTAAAAATAATAATAGGCCAAGT 
GCAGTGGCTCATACATGTAATACCAGCAGTTTGGAAGACCAAGGTGGGCAGATCACTTGAGCCCAGGAGTTTGAGACCAG 
CCTGGGCAACATGGCTAAACCCCATCTCTACCAAAAAAGATATAAAAAATTAACCAAGTGTGGTGGCATGTGCCTGTAGT 
CCCAGCTACTCGGGAGGCTAAGGTGGGAGGATCGCTTGAGCCCAGAAGGTTGAGGCTGCAGTGAGCCATGATCGTGTCAC 
TGCACTCTAGCCTGGGTGACAGAGTGAGACCCTGTCTCAAAAAATAATCAGCATCATAAAAAGAAACCAGCAAAATTAAC 
TTTACTAGTATATTTAACCCAATATATATAAAATATTATTTCAATATGCTTCCACTATAAAAAATTATTTTACAGTCTTT 
TATTTCCATATTAAGTCTTTAAAAATCTGATGTGTAGTTTGTACTTACAGCACGTTGCAGTTAGGACTGGCCACATTTTA 
AGTGCACAGTAGCCACAGGGGGCCACTGGCTACCATATTGGATAGTGCCATTCTAGAAGCTTTCAGCTTTTTCAACTGGA 
TGCCTCTGATTTGTGGACTCAGAATACAGATAACCAAAGAAGTGGGACTAGTGTCTGAAGTAAGAATGACAGGGTATGAT 
TGAGAGCCCCATGAGCTTACCTAGGAGAGAAACTTGTGGGGTTGCAGAATAAGGATTTGTCAATATTGGCTCTAGCTGTT 
CACACTATTTCTGGGCCAACTCCCAGATCATTTCTCAACTCCAGATAGTTAAGTGGGGAGCATGGCTGCACTTTTTAAAG 
TGATGGCACAAAAAAAGATATTGAACGTTGGTCCTCTGATTATATATTCTAAATATGCAGTTAGAAAAGAGGCCTTTTAA 
GAATCCCTAAGAGTAAAGCAAATTAGTATCTTTGTTTCCTGAAAATTAGAGAAACTTGATATGCCATGATAGCCCTCTTC 
ATTTTATTTGGAAAACTCTTCTATGAAAGCTTATTAGAGAAAATTTTCTGATTAGCTTCATGCCTCCCCTCCTTCAGCAA 
GGTCAAGGGTGCAGTTGTCACTATCACATAAGAATCTCATAAAAATTAAACATGAATATACTGCACAGATCTGATTGGGT 
TTGTCATGCCACACATTGTTTTAAATTCCATAATTCTATTCTATAAAGAGTGGTTTCTATGACAATAGATCGTTTTAAAA 
ACAAACAAACAAACAAAATTTAGAGTTGTCATTGGTAATTGTGGTTGCAAGTATGCTTTCAAAGACCAGAAGCTTTTGTT 
TTGCTTTGAATGTAATTTTTTTCTTTTTTCTTTTTGATACGGAGTCTCACTCTGTTGCCCAGGCTGGAGTGCATTGGCAC 
CATCTCAGCTCACTGCAACCTCCACCTCCGTGGTTCAAGCAATTCTCCTGCCTCAGCCTCCTGAGTAGCTGGGATTACAG 
GCGTCCACCACCACGCCTGACTAATTTTTGTATTTTTAGTAGAGATGGGGTTTTACCATGTTGGCCAAGCTGGTCTCAAA 
TTCCTGACCTCAGGTGATCCACCTGCCTCTGCCTCCCAAAGTGCTG 
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GGAACAATTTCCTCTCATGTGTATGGCTCCCTAAAGTGTTGGCTGAGCATTGTCCACATGGGTG 
ATGCAAAGGATCACTGAACTAGGAGCAGTTGGGAAAAAATACAATCATTGGGAATTCCTGTAGC 
ATCGAATGTGCCTACAGGGAGGTAGAAGTATTCATACAACAGTTCTCTGGTGTTCTCTGTTGTA 
GCAACCAGTCAGCCAAAAGGGTTCAGCTGCTTGAAATGAGAATGGCTGGATCAAAATGGCAGCT 
CATGATTTAAAGGATTCTAGTCAGATACCAGACATCCTCACATAGAGAAAACTCTGAATGGCTG 
GGGGAGAAGGAGTCAAATGCCCTGGATCTTTTTCTTGGGCCTCAAAGTCCTCCTTCTGTCATCA 
TCCTTCCAGTATTGGGCAGGACCTGACTGCAGGCATCATGGCCTCTGTGAACTTCTCZ^GGGTA 
TGTATTATCTGACAAAAACTACGATGTCCACTAACAGGCCACTGAAAGGTATCTTAGTCAGTTC 
TGCTCATTGCCCAGCCAAGGCCTACGTTTTATAACATGATATCAAAGATTGCATCTAAAATTGT 
GATGATTTCCTAAAATAATCATTTCATTTAGATTTTTCTATTTTAATCCAAGGTATTCTTCAGC 
GGAAATAAGGAAACAGTTTACTCTCCCACCAAACCTTGGCCAGTACCATCGACAGAGCATAAGT 
ACCTCTGGCTTCCCCTCTCTTCAACTAGTAAGTATGAGTTCCAGGTTTACTTAGCGATTGGTCA 
AGTGCAAAAGTGCCCAGGGTATGTGTTTGCCTCCTGTTCCTTAGATCTTCCTACCATCACCTCA 
CATTCTCCAGTCACCAGATCCTAACTCTGTGACTGTGTCTGGACATCAGACAATATCCCTCTCT 
CTCTCTGCCAACCGGTACTTAGGGTACATAATAGAACCTCTGGGAGCTGTGGTTTTGATGTCTC 
TAGACTAGGTGGGCTTCCAGGTGACTCAGTCTCATCCAAATTATGGTTCATATTTGGGGGAGAA 
GGGCTAGCCCAAAAACTTACCACCATTTGrAGTATGCATTTTTTTGGAAAAGCATATTCCAAAA 
TCTGAAATGCCAAGTTACAGACCTCCTTTTTGTAAAATAATTTTCTTGCTAGTATAATTTACAT 
ATAATAAAATTCACACATTTTAGGTGTACAATTTGGTGAACTTGGGCAACTTAGAGTCACTTAA 
CCTTTCCTCAGTCAAGATATAGAACACTTCTTTTATCCTAAAGCGTTCCCCAGCGCGCTTTTAC 
AATCTCCTCTCCCCAGGCCACACCCTCCAACTCACGCAATCTCTGACTCACTTCTGTCACCATA 
ATTTTGCTCTATCTGGAGCTTCATATCCTGTTACAGTATGTACAAACCTTCTTTTTTTGAGACA 
GGGTGTCAGTCTGTCACCCAGCCTGGAGTACAGAGGTGTGATCTCAGCTCACTGCAACCTCAAC 
CTCCCAGGATCAGATGATTCTCCTCCCACCTCATCCTCCCAAGTAGCCGGGACTACAGGCGCAT 
GCCACCACACCTGGCTAATTTTTGTACTTTTTGTAGAGACAGGGGTCTCGCTATGTTGCCCAGG 
CTGGTCTTGAACTCCTGGGCTCAAGCGATCCTCCTGCCTCAGCCTCCCAAAGTGCTGGGATTAC 
AGTGAGCCACTGCACCTGGCCCTAAACCTTCATTTTTAAAACACATTTCCTCTTAAATTGAAGA 
TTGCCTACATTTTTATATCAATGCCAATTGTTGAGTGTGCCTATATGTGTTATATTATTTGAGC 
ACTAAATGCCAGATGTGTGCCAAGTGAGATAAATCTGACAAATGAGATGGTTTGTAAAACCAGC 
AGTGAATATTCACTTCCTCTGTGAGAGAGCTCCAGCCCTCCTGTACTCACTTCCTCACACAGCA 
CAGCAGCACTCTTGCTGGTTCTGCTGCTTATCTTGAAGAGGTTAGGTTACTTTTTGTTTCTACT 
TATTACTTCGAAACCACTTCTGCCTTAGAAATTTTGTAACCTTCCGCTCAGTTTC CGGTAACCG 
CCATTTTGTCTCCTGTAACAATTTACGCGCCGTGTAACTGTGAATCTTT 
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MFPMEDISISVIGRQRRTVQ 20 

MTHLNSLDVQLAQELG 16 

MAERRAF AQK I S RT VAAE VRKQ I S GQ Y S G S PQLLKNLN I VG 41 

MLLFPYDDFQTAILRRQGRYICS 23 

MAASERRAFAHKINRTVAAEVRKQVSRERSGSPHSSRRCSSSL 43 

MSFRGKVFKREPSEFWKKRRTVRRVIQEEFHRFSSQEKPRLLEPLDYETVIEELEKTYRN 60 



STVPEDAEKRAQSLFVKECIKTYSTDWHWNYK 53 

DFT 19 

N ISHHTTVPLTEAVDPVDLEDYLITHPLAVDSGPLRDLIEFP 83 

TVPAKAEEEAQSLFVTECIKTYNSDWHLVNYK 55 

G VPLTEWEPLDFEDVLLSRPPDAEPGPLRDLVEFP 7 9 

DPLQDLLFFPSDDFSAATVSWDIRTLYSTVPEDAEHKAENLLVKEACKFYSSQWHVVNYK 120 



YEDFSGDFRMLPCKSLRPEKIPNHVFEIDEDCEKDED SSSLCSQKGGVIKQG 105 

DDDLDWFTPKECRTLQP-SLPEEGVELDPHVR DCVQTYIREWLI 63 

PDDIEWYSPRDCRTLVS-AVPEE-SEMDPHVR DCIRS YTEDWAI 126 

YEDYSGEFRQLPNKWKLDKLPVHVYEVDEEVDKDED AASLGSQKGGITKHG 107 

ADDLELLLQPRECRTTEP-GIPKD-EKLDAQVR AAVEMYIEDWVI 122 



YEQYSGDIRQLPRAEYKPEKLPSHSFEIDHEDADKDEDTTSHSSSKGGGGAGGTGVFKSG 18 0 

WLHKANVNSTIT — VTMKVFKRRYFYLTQLPDGSYILWS YKDEKNSKESK-GCIYLDACI 162 

VNRKNQGSPEIC — GFKKTGSRKDFHKT-LPKQTFESETLECSEPAAQA — GPRHLNVLC 118 

VIRKYHKLGTGF — NPNTLDKQKERQKG-LPKQVFESDEAPDGNSYQDDQDDLKRRSMSI 183 

WLYKGNMNSAIS— VTMRSFKRRFFHLIQLGDGSYNLNFYKDEKISKEPK-GSIFLDSCM 164 

VHRRYQYLSAAY — SPVTTDTQRERQKG-LPRQVFEQDASGDERSGPEDSNDSRRGSGSP 17 9 

WLYKGNFNSTVNNTVTVRSFKKRYFQLTQLPDNSYIMNFYKDEKISKEPK-GCIFLDSCT 23 9 



DVVQCPKMRRHAFELKMLDKYSHYLAAETEQEMEEWLITLKKIIQINTDSLVQEKKETVE 222 



DVSGKGPVTACDFDLRSLQPDKRLENLLQQVSAEDFEKQNEEARRTN RQAE 169 

DDT PRGSWACS I FDLKNSLPDALLPNLLDRT PNEE I DRQNDDQRKSN RHKE 234 

GVVQNNKVRRFAFELKMQDKSSYLLAADSEVEMEEWITILNKILQLN FEAAMQEK 219 

EDTPRSSGASSIFDLRNLAADSLLPSLLERAAPEDVDRRNETLRRQH RPPA 230 



GWQNNRLRKYAFELKMNDLTYFVLAAETESDMDEWIHTLNRILQISPEGPLQGRRSTEL 2 99 
^ : * : . : : . : 

TAQDDETSS QGKAENIMASLERSMHPELMKYGRETEQLNKLSRGDGRQNLFSFDSE 278 

LFALYPSVD EEDAVEIRPVPECPKEHLG N RILVKLLTLKFEIE 212 

LFALHPSPD EEEPIERLSVPDIPKEHFG QRLLVKCLSLKFEIE 277 

RNGDSHEDD EQSKLEGSGSGLDSYLPELAKSAREAEIK LKSESRVKLFYLDPD 272 

LLTLYPAPD EDEAVERCSRPEPPREHFG QRILVKCLSLKFEIE 273 

TDLGLDSLDNSVTCECTPEETDSSENNLHADFAKYLTETEDTVKTTRNMERLNLFSLDPD 359 



VQRLDFS GIEPDIKP-FEEKCNKRFLVNCHDLTFNILGQIGDNAKGPPTNVEPFFI 333 

lEPLFAS lALYDVKERKKISSNFHCDLNSDQFKGFLRAHTPSVAASSQARSAVFSV 2 68 

lEPIFAS LALYDVKEKKKISENFYFDLNSEQMKGLLRPHVPPAAITTLARSAIFSI 333 

AQKLDFS SAEPEVKS-FEEKFGKRILVKCNDLSFNLQCCVAENEEGPTTNVEPFFV 327 

lEPIFGI LALYDVREKKKISENFYFDLNSDSMKGLLRAHGTHPAISTLARSAIFSV 329 

IDTLKLQKKDLLEPESVIKPFEEKAAKRIMIICKALNSNLQGCVTSNENDPITNIEPFFV 419 
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NLALFDVKNNCKISADFHVDLNPPSVREMLWGSSTQLASDGSP KGSSPESYIHGIAE 390 

TYPSSDIYLWKIEKVLQQGD IGDCAEPYTVIKESDG GKSKE-KIEKLKL 317 

TYPSQDVFLVIKLEKVLQQGD IGECAEPYMIFKEADA TKNKE-KLEKLKS 382 

TLSLFDIKYNRKISADFHVDLNHFSVRQMLATTSPALMNGS GQSPSVLKGILHE 381 

TYPSPDIFLVIKLEKVLQQGD ISECCEPYMVLKEVDT AKNKE-KLEKLRL 378 

SVALYDLRDSRKISADFHVDLNHAAVRQMLLGASVALENGNIDTITPRQSEEPHIKGLPE 47 9 
* : : : . ... : 

SQLRYIQQGIFSVTNPHPEXFLVARIEKVLQGNITHCAEPYIKISrSDPVKTAQKVHRTAKQ 4 50 

QAESFCQR LGKYRMPFAWAPISLSSFFNVSTLEREVTDVDSWGRSPVGERRTLA 372 

QADQFCQR LGKYRMPFAWTAIHLMNIVSSAGSLERDSTEVEISTGERKGSWSERR 437 

AAMQYPKQGIFSVTCPHPDIFLVARIEKVLQGSITHCAEPYMKSSDSSKVAQKVLKNAKQ 441 

AAEQFCTR LGRYRMPFAWTAVHLANIVSSAGQLDRDSD SEGERRPAWTDRR 429 

EWLKFPKQAVFSVSNPHSEIVLVAKIEKVLMGNIASGAEPYIKNPDSNKYAQKILKSNRQ 539 



VC S RLGQ YRMPFAVfA ARP I 

QSRRLSERALSLEENGVGSNFKTS TLS 

NS S I VGRRS LERT T S GDD ACNLT SFR- 



FKDTQGSLDLDGRFSPLYKQDSSKLSS 



SEDILKLLSEYKKPE 
VS SFFKQEGDRL S DEDLFKFL ADYKRS S 
PATLTVTNFFKQEGDRLSDEDLYKFLADMRRPS 



ACQRLGQYRMPFAWAARTLFKDASGNLDKNARFSAIYRQDSNKLSNDDMLKLLADFRKPE 

RRGPQ — DRASSGDDACSFSGFR-PATLTVTNFFKQEAERLSDEDLFKFLADMRRPS 

FCSKLGKYRRAFAWAVRSVFKDNQGlSrVDRDSRFSPLFRQESSKISTEDLVKLVSDYRRAD 



510 
427 
496 
501 
483 
599 



— KTKLQIIPGQLNITVECVPVDLSNCITSSYVPLKPFE-KNCQNITVEVEEFVPEMTKY 5 67 

SLQRRVKSIPGLLRLEISTAPEIINCCLTPEMLPVKPFP-ENRTRPHKEILEFP — TREV 4 84 

SVLRRLRPITAQLKIDISPAPENPHYCLTPELLQVKLYP-DSRVRPTREILEFP — ARDV 553 

K-MAKLPVILGNLDITIDNVSSDFPNYVNSSYIPTKQFETCSKTPITFEVEEFVPCIPKH 560 

SLLRRLRPVTAQLKIDISPAPENPHFCLSPELLHIKPYP-DPRGRPTKEILEFP — AREV 54 0 

R-ISKMQTIPGSLDIAVDNVPLEHPNCVTSSFIPVKPFNMMAQTEPTVEVEEFVYDSTKY 658 



CYPFT] 



YV 



> DAS ALKCI YGKE AGS V 



:IYKNHLYVYPLQLKYDSQKTFAKARNIAVCVEFRDSDES 
/IPHTVYRNLLYVYPQRLNFVN — KLASARNITIKIQFMCG-EDASNAMPVIFGK^S 
YVPNTTYRNLLYIYPQSLNFAN — RQGSARNITVKVQFMYG-EDPSNAMPVX 
TQPYTIYTNHLYVYPKYLKYDSQKSFAKARNIAICIEFKDSDEEDSQPLKC 
YAPHTSYRNLLYVYPHSLNFSS — RQGSVRNLAVRVQYMTG-EDPSQALPVIFGK^SCSE 
CRPYRVYKNQIYIYPKHLKYDSQKCFNKARNITVCIEFKNSDEESAKPLKC 



FGKSSCSE 
:iYGREGGPV 



:iYGKFEGPL 



627 
GPE 541 
610 
620 
597 
718 



FTTN \YAWSHHNQNPEFYDEIKIELPIHLHQKHHLLFTFYHVSCEINTKGTTKKQDTv; 

FLQE /YTAVT YHNKS PDF YEEVKIKL PAKLT VNHHLLFTFYHI S CQ Q KQGAS ^E 

FSKE^YTAWYHNRSPDFHEEIKVKLPATLTDHHHLLFTFYHVSCQQ KQNTPLE 

FTRS !^FAAVLHHHQNPEFYDEIKIELPTQLHEKHHLLLTFFHVSCDNSSKGSTKKRDV/E 



^KE 687 
595 
664 
680 

FTREkFTPWYHNKSPEFYEEFKLHLPACVTENHHLLFTFYHVSCCjP RPGTALE 651 

FTSAf^YTAVLHHSQNPDFSDEVKIELPTQLHEKHHILFSFYHVTCqiNAKANAKKKEAILE 778 



TPVGFAWVPLLKDGRI 
TLLGYSWLP 
TPVGYTWI 



ITFEQQLPVSANLPPGYLNLNDAESRRQCNVDIKWVDGAKPLLK 
ILLNERLQTGSYCLPVALEKLPPNYSMHSAEKVPLQNPp|lKWAEGHKGVFN 
PMLQNGRLKTGQFCLPVSLEKPPQAYSVLSPEVP- 
TQVGYSWLPLLKDGRWTSEQHIPVSANLPSGHLGYQELGMGRHYGPEfrKWDGGKPLLK 



747 
655 

LPGHKWVDNHKGVFN 721 
740 

TPVGFTWIPLLQHGRLRTGPFCLPV^DQPPPSYSVLTPDVA LPGMRWVDGHKGVFS 708 

TSVGYAWLPLMKHDQIASQEYNIPIATSLPPNYLSFQDSASGKHGGSDIKWVDGGKPLFK 83 8 



FIG. 8(2 of 6) 



hCLASP4 
hCLASPS 
hCLASP3 
hCLASP2 
hCLASPV 
hCLASPl 



hCLASP4 
hCLASP5 
hCLASP3 
hCLASP2 
hCLASP? 
hCLASPl 



hCLASP4 
hCLASPS 
hCLASPS 
hCLASP2 
hCLASP7 
hCLASPl 



FKSHLESTIYTQDLHVHKFFHHCQLIQS 

lEVQAVSSVHTQDNHLEKFFTLCHSLESpVTFPIRVLDQKISEMALEHELKLS 
VEWAVSSIHTQDPYLDKFFALVNALDEfi- 
ISTHLVSTVYTQDQHLHNFFQYCQKTES 
VELTAVSSVHPQDPYLDKFFTLVHVLEE' 
VSTFWSTVNTQDPHVNAFFQECQKREKp' 



GSKEVPGELIKYLKCLHAM 7 94 

IICLNSS 715 

LFPVRIGDMRIMENNLENELKSSISALNSS 780 

GAQALGNELVKYLKSLHAM 7 87 

-AFPFRLKDTVLSEGNVEQELRASLAALRLA 7 67 

■MSQSPTSNFIRSCKNLLNVE 8 87 



EG 
S 



HVMIAFLPTILNQLF R 

EjEPLVAFSHHVLDKLVULVIRPPI 



KI)HAIMSFLPIILNQLFP' 



EllQVMIQFLPVILMQLFEl VLTNMTH EDDVP 824 

Rl EPLVLFLHLVLDKLF 2LSVQPMVIAGQTANFSQFAFESVVAIANSLHNSKDLSKDQHG 7 7 5 

QIIEPWRFLHLLLDKLlfLVIRPPVIAGQIVNLGQASFEAMASIINRLHKNLEGNHDQHG 84 0 

VLT-RAT QEEVA 816 

I S GQI VNLGRGAFEAMAHWS LVHRSLEAAQDARG 827 
VLVQWE EDEIT 916 



INCTMV-LLHIVSKCHEEGLDS YLRSFIKYS FRPEKP 860 

RNCLLASYVHYVFRLPEVQRDVPKSGAPTALLDPRSYHTYGRTSAAAVSSKLLQARVMSS 835 
RNSLLASYIHYVFRLPNTYPNSSSPG-PGGLGGSVHYATMARSAVRPASLNLNRSRSLSN 8 99 

VNVTRV-IIHVVAQCHEEGLES HLRSYVKYA YKASPY 852 

HCPQLAAYVHYAFRLPGTEPSLPDGAPP VTVQAATLARGSGRPASLYLARSKSISS 883 

TTVTRV-LPDIVAKCHEEQLDH SVQSYIKFV FKTRAC 952 
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SAPQAQLIH ETLATTMIAILKQS 

SNPDLAGTHSAADEEVKNIMSSKIADRNCSRMSYYCSGSSDAPSSPA- 



883 
882 



SNPDISGTPTSPDDEVRSIIGSKGLDRSNSWVNTGGPKAAPWGSNPSPSAESTQAMDRSC 959 

875 

915 

972 



VASEYKTVH EELTKSMTTILKPS 

SNPDLAVAPGSVDDEVSRILASKLLHEELA-LQ- 
KE RPVH EDLAKNVTGLLKSN 



hCLASP4 
hCLASPS 
hCLASP3 
hCLASP2 
hCLASP? 
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ADFLSINKLLKYS WFFFEIIAKSM 907 

APRPASKKHFHEELALQ MWSTGMVKSM 91 0 

NRMSSHTETSSFLQTLTGRLPTKKLFHEELALQWVVCSGSVRESALQQAWFFFELMVKSM 1019 

ADFLTSNKLLRYS WFFFDVLIKSM 8 99 

WWSSSAVREAILQHA WFFFQLMVKSM 942 

DSPTVKHVLKHS WFFFAIILKSM 995 
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Cadheri n Cl eavage 

ATYLLEEMKrKLptRGQPjFPETYHHVLHSLLLAIIPHVTIRYAEIPDE SRNVNYSLAS 

AQHVHNMDKRDSFRRTI^FSDRFMDDITTIVNWTSEIAALLVKPQKENEQAEKMNISLAF 

VHHLYFNDKLEAFRKSRFPERFMDDIAALVSTIASDIVSRFQKDTEM VERLNTSLAF 

AQHLIENSKVKLIRNQPFPASYHHAAETWNMLMPHITQKFGDNPEA SKNANHSLAV 

ALHLLLGQRLDTPRKLFFPGRFLDDITALVGSVGLEVITRVHKDVEL AEHLNASLAF 

AQHLIDTNKIQLPRPQRFPESYQNELDNLVMVLSDHVIWKYKDALEE TRRATHSVAR 



FLKRCLTLMDRGFIFNLINDYISGFSPKDP- 
FLYDLLSLMDRGFVF 
FLNDLLSVMDRGFVF 



-K /LAEYKFEFLQTICNHEHYIPLNL 

FLYDLLSLMDRGFVpjNLIRHYCSQLSAKLSNL P rLISMRLEFLRILCSHEHYLNLNL 

/LVSLRLDFLRIICSHEHYVTLNL 

FIKRCFTFMDRGFVBjKQIMNYISCFAPGDP K TLFEYKFEFLRWCNHEHYIPLNL 

FLSDLLSLVDRGFVFSLVRAHYKQVATRLQSSPNPA^LLTLRMEFTRILCSHEHYVTLNL 
FLKRCFTFMDRGCVF KMVNNY I SMFS SGDL K rLCQYKFDFLQEVCQHEHFI PLCL 



964 

970 

1076 

956 

999 

1052 



1019 
1027 
1136 
1011 
1059 
1107 
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Cadherin EC motif 



PMAFAKPKLQR VQDS — NLEYSLSDEYCKHHFLVGI LLRE TSI 1060 

FFMNADTAPTSP — CPSISSQNSSSCSSFQDQKIASMFDLTSEYRQQHFLTGILFTELAA 1085 

PCSLLTPPASPSPSVSSATSQSSGFSTNVQDQKIAiaMFELSVPFRQQHYLAGIVLTELAV 1196 

PMPFGKGRIQR YQDL — QLDYSLTDEFCRNHFLVGILLRE \/GT 1052 

PCCPLSPPASPSPSVSSTTSQSSTFSSQAPDPKVTSMFELSGPFRQQHFLAGILLTELAL 1119 



PIRSANIPDPLTP- 



— SES TQELHASDMPEYSVTNEFCRKHFLIGILLRE 



VCF 1157 



ALQDN YEIRYTAISVIKNLLIKHAFDTRYQHKNQQAKIAQLYLPFVGLLLENI 

ALDAEGEGISK\/QRKAVSAIHSLLSSHDLDPRCVKPEVKVKIAALYLPLVGIILDALP' 
ILDPDAEGLFGtLHKKVINMVHNLLSSHDSDPRYSDPQIKARVAMLYLPLIGIIMETv|p 
ALQEFR 

ALEPEAEGAFLtLHKKAISAVHSLLCGHDTDPRYAEATVKARVAELYLPLLSIARDTL[P 
ALQEDQ 



AGRDTLYSCA AMPN-S ASRDEFPCGFTSPANRGSLSTDKDTAYGS 1160 

QL CDFTVADTRRYRTSGSD 1162 

QLY DFTETHNQRGRPICIATDD— 127 6 

NVRDVSPFPVNAGMTVKDESLALPA-VMPLVTPQKGSTLDNSLHKDLLGAISGIASPYTT 1167 

RLH DFAEGPGQRSRLASMLDSDTE 1201 

YLKDLYPFTVNTSNQGSRDDLSTNGGFQSQTAIKHANSVDTSFSKDVLNSIAAFSSIAIS 1273 



FQ-NGHGIKREDSRGSLIPEGATGFPDQGNTGEN TRQSSTRSSVSQYNRLDQYE 1213 

EEQEGAGAINQNVALAIAGNNFNLKT SGIVLSSLPYKQYNMLNADT 12 08 

YESESGSMISQTVA>miAGTSVPQLTR PGSFLLTSTSGRQHTTFSAES 1324 

STPNIMSVRNADSRGSLISTDSGNSLPERNSEKSNSLDKHQQSSTLGNSWRCDKLDQSE 122 7 

GEGDIAGTINPSVAMAIAGGPLAPGSR ASISQGPPTASRAGCALSAES 124 9 

TVNHADSRASLASLDSNPSTNEKSSEKTDNCEKIPRPLALIGSTLRFDRLDQAE 1327 



IRSLLMCYLYIVKMISEDTLLTYWNKVSPQELINILILLEVCLFHFRYMGKRNIAPVHDA 1273 
TRNLMICFLWIMKNADQSLIRKWIADLPSTQLNRILDLLFICVLCFEYKGKQSSDi^VSTQ 1268 
SRSLLICLLWLKNADETVLQKWFTDLSVLQLNRLLDLLYLCVSCFEYKGKKVFEPMNSL 1384 
IKSLLMCFLYILKSMSDDALFTYWNKASTSELMDFFTISEVCLHQFQYMGKRYIAPNQEG 12 87 
£ RTLLACVLWVLKNTSPALLQRWATDLTLPQLGRLLDLLYLCLAAFEYKGKKAFEI? INSL 1309 
IRSLLMCFLHIMKTISYETLIAYWQRAPSPEVSDFFSILDVCLQISIFRYLGKRNIIPKIAA 1387 



WLSKHFGIDR— 
VLQKSRDVKAR- 
TFKKSKDMRAK- 
LGPIVHDRKS — 
TFKKSLDMKAR- 



— KSQTMPALRNRSGVMQARLQHLSSLESS 1311 

-LEE ALLRGEGARGEMMRRRAPGNDRFPGLNEN 1311 

-LEEAI LGS I GARQEMVRRSRGQLERS PSGSAFGSQ 1430 

QTLPVSRNRTGMMHARLQQLGSLDNS 1323 

-LEEAILGTIGARQEMVRRSRERSPFGNPEN 1350 



AFKFVQSTQNNGTLKGSKPSCQTSGLLAQWMHSTSRHEGHKQHRSQTLPIIRGKN- 



1442 



FTLNHSSTTTEi^lDIFHQALLEGNTATEVSLTVLDTISFFTQCFKTQLL 
LRWKKEQTHWRQANEKLDKTKAELDQEALISGNLATEAHLIILDMQENIIQASS- 
ENLRWRKDMTHWRQNTEKLDKSR^^IEIEHEALIDGNLATEANLIILDTLEIvlvQTVS- 
-LTFNHSYGHS 



DADVLHQSLLEANIATEVCLTALDTLSLETLAFKNQLL 
— VRWRKSVTHWKQTSDRVDKTKDEMEHEALVEGNLATEASLVVLDTLEIIVQTVM-LSE 14 07 
-ALSNPKLLQMLDNTMTSNSNEIDIVHHVDTEANIATEGCLTILDLVSLFjTQTHQRQLQ 1500 

* * * it * * * 



1359 
•ALD 1368 
•VTE 1489 
1371 
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NNDGHNPLMKKVFDIHLAFLKNGQSEVSLKHVFASLRAFISKFPSAFFKGRVNMCAAFCY 1419 

CKDS LLGGVLRVLVNSLNCDQSTTYLTHCFATLRALIAKFGDLLFEEEVEQCFDLCH 1425 

SKES ILGGVLKVLLHSMAClSfQSAVYLQHCFATQRALVSKFPELLFEEETEQCADLCL 154 6 

ADHGHNPLMKKVFDVYLCFLQKHQSETALKNVFTALRSLIYKFPSTFYEGRADMCAALCY 1431 

ARES VLGAVLKVVLYSLGSAQSALFLQHGLATQRALVSKFPELLFEEDTELCADLCL 14 64 

QCDCQNSLMKRGFDTYMLFFQVNQSATALKHVFASLRLFVCKFPSAFFQGPADLCGSFCY 1560 
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hCLASPS 
hCLASP2 
hCLASP7 
hCLASPl 



EVLKCCTSKI S STRNEASALL YLLMRNNFE YTKRKTFLRTHLQI I lAVSQLI ADVALSGG 147 9 
QVLHHCSSSMDVTRSQACATLYLLMR— FSFGATSNFARVKMQVTMSLASLVGRAPDFNE 1483 
RLLRHCSSSIGTIRSHPSASLYLLMR — QNFEIGNNFARVKMQVPMSLSSLVGTSQNFNE 1604 
EILKCCNSKLSSIRTEASQLLYFLMRNNFDYTGKKSFVRTHLQVIISVSQLIADWGIGE 14 91 
RLLRHCGSRISTIRTHASASLYLLMR — QNFEIGHNFARVKMQVTMSLSSLVGTTQNFSE 1522 
EVLKCCNHRSRSTQTEASALLYLFMRKNFEFNKQKSIVRSHLQLIKAVSQLIADAG-IGG 1619 



hCLASP4 
hCLASPS 
hCLASPS 
hCLASP2 
hCLASP7 
hCLASPl 



hCLASP4 
hCLASP5 
hCLASPS 
hCLASP2 
hCLASP7 
hCLASPl 



SRFQESLFIINNFANSDRPMKATAFPAEVKDLTKRIRTVLMATAQMKEHEKDPEMLIDLQ 1539 
EHLRRSLRTILAYSEEDTAMQMTPFPTQVEELLClSlLNSILYDTVKMREFQEDFEMLMDLM 1543 
EFLRRSLKTILTYAEEDLELRETTFPDQVQDLVFNLHMILSDTVKMKEHQEDPEMLIDLM 1664 
TRFQQSLSIINNCANSDRLIKHTSFSSDVKDLTKRIRTVLMATAQMKEHElSfDPEMLVDLQ 1551 
EHLRRSLKTILTYAEEDMGLRDSTFAEQVQDLMFNLHMILTDTVKMKEHQEDPEMLIDLM 1582 
SRFQHSLAITNNFANGDKQMKNSNFPAEVKDLTKRIRTVLMATAQMKEHEKDPEMLVDLQ 167 9 



. ***** . * * 



transmembrane 



YSLAKSYASTPELRKTWLDSMAKIHVKNGDESEAAMCYVHVAALVAEFIHRKK- 
YRIAKSYQASPDLRLTWLQNMAEKHTKKKC^fTEAAMCLVHAAALVAEYI 
YRIAKGYQTSPE-RLTWLQNMAGKHSERSNtiAEAAQCLVHSAALVAEYI 
YS LAKS YAS T PE LRKT WL DSMAR I HVKNGD LSE AAMC YVHVTALVAE YI TRKG 
YRIARGYQGSPDLRLTWLQNMAGKHAELGNHAEAAQCMVHAAALVAEYIALLEDQ' 



. * . * * * * . 



YSLANSYASTPELRRTWLESMAKIHARNGDLSEAAMCYIHIAALIAEYIKRKGYWKVEKI 173 9 



. * * . * * . 



- 1592 

SMLEDH 1598 

SMLEDR 1718 

- 1604 

- 1637 



hCLASP4 
hCLASPS 
hCLASP3 
hCLASP2 
hCLASP7 
hCLASPl 



LFPNGCSAFKKITPNIDEEGAMKEDAGMMD 1 622 

SYLPVGSVSFQNISSNVLEESWSEDTLSPDEDGV 1633 

KYLPVGCVTFQNISSNVLEESAVSDDWSPDEEGI 1753 

VFRQGCTAFRVITPNIDEEASMMEDVGMQD 1634 

RHLPVGCVSFQNISSNVLEESAISDDILSPDEEGF 167 2 

CTASLLSEDTHPCDSNSLLTTPSGGSMFSMGWPAFLSITPNIKEEGAAKEDSGMHD 17 95 



IT AM 



hCLASP4 
hCLASPS 
hCLASP3 
hCLASP2 
hCLASP7 
hCLASPl 



hCLASP4 
hCLASPS 
hCLASP3 
hCLASP2 
hCLASP7 
hCLASPl 



VHYSEEVLLELLEQCVDGLWKAERYEIISEISKLIVPIYEKRREFEKLTQVYRTLHG 1679 

CAGQ YFTESGLVGLLEQAAELFSTGGLYETVNEVYKLVI PI LEAHREFRKLTLT HSKI QR 1693 
CSGKYFTESGLVGLLEQAAASFSMAGMYEAVNEVYKVLIPIHEANRDAKKLSTIHGKIQE 1813 

VHFNEDVLMELLEQCADGLWKAERYELIADI YKLII P I YEKRR 1677 

CSGKHFTELGLVGLLEQAAGYFTMGGLYEAVNEVYKNLIPILEAHRDYKKLAAVHGKIQE 17 32 
TPYNENILVEQLYMCGEFLWKSERYELIADVNKPIIAVFEKQRDFKKLSDIYYDIHR 1852 



ITAM 



DOCK motif 



DOCK motif 



ITAM 



A YTKI LEVMHTKKRLLGT FFRV AF ^GQS FFEEEDGKE^ I ^KEP KLTGLSEI SLRLVKL YG 173 9 

AFDSIVNKDH — KRMFGT YFRVGFFG-SKFGDLEEQEE V^KEP AITKLPEISHRLEAF YG 1750 

AFSKIVHQSTGWERMFGTIfFRVGFyG-TKFGDLCEQEEV^fKEPAITKLAEISHRLEGFYG 1872 

EFFEDECGKE^IifKEPKLTPLSEXSQRLLKL^S 1710 

AFTKIMHQSSGWERVFGTrFRVGFre-AHFGDLI^EQEPV^fKEPSITKLAEISHRLEEFYT 17 91 

SYLKVAEVVNSEKRLFGRirYRVAP!i:GQGFFEEEEGKE^I^KEPKLTGLSEISQRLLKLYA 1912 

* . . .* * *** ** .* 
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IT AM 



IT AM 



hCLASP4 
hCLASP5 
hCLASP3 
hCLASP2 
hCLASP7 
hCLASPl 



hCLASP4 
hCLASP5 
hCLASPS 
hCLASP2 
hCLASPV 
hCLASPl 



hCLASP4 
hCLASPS 
hCLASPS 
hCLASP2 
hCLASP? 
hCLASPl 



hCLASP4 
hCLASPS 
hCLASPS 
hCLASP2 
hCLASP7 
hCLASPl 



hCLASP4 
hCLASP5 
hCLASP3 
hCLASP2 
hCLASP7 
hCLASPl 



YFDDKELTERKTEFERNHNISRFV 



EKFGTENVKIIQDSDKVNAKELDPKYAHIQVrfVKP 
QCF'GAEFVEVIKDSTPVDKTKLDPNKAYIQir^'VEPYFDEYEMKDRVTYFEKNFNLRRFM 
ERrGEDVVEVIKDSNPVDKCKLDPNBCAYIQI T iTVEF YFDTYEMKDRITYFDKNYNLRRFM 
DKJ^GSENVKMIQDSGKVNPKDLDSKYAYIQV|r^VIPFFDEKELQERKTEFERSHNIRRFM 
ERrGDDWEIIKDSYPVDKSKLOSQKAYIQiriTVEFYFDTYELKDRVTYFDRNYGLRTFL 
DKfGADNVKIIQDSNKVNPKDLDPKYAYIQVriTVTFFFEEKEIEDRKTDFEMHHNINRFV 



1799 
1810 
1932 
1770 
1851 
1972 



IT AM 



POCK motif 



FEAPYTLSGKKQGCIEEQCKRRTILTTSNSFPfVKKRIPINCEQQINLKPIDGATDEIKD 
YTTPFTLEGRPRGELHEQYRRNTVLTTMHAFPflKTRISVIQKEEFVLTPIEVAIEDMKK 
YCTPFTLDGRAHGELHEQFKRKTILTTSHAFPflKTRVNVTHKEEIILTPIEVAIEDMQK 
FEMPFTQTGKRQGGVEEQCKRRTILTAIHCFPiTVKKRIPVMYQHHTDLNPIEVAIDEMSK 
FCTPFTPDGRAHGELPEQHKRKTLLSTDHAFPflKTRIRVCHREETVLTPVEVAIEDMQK 
FETPFTLSGKKHGGVAEQCKRRTILTTSHLFP^VKKRIQVISQSSTELNPIEVAIDEMSR 



1859 
1870 
1992 
1830 
1911 
2032 



Coiled-coil 



KTAELQKLCSSTDVDMIQLQLKLQGWSVQVNAGPLAYARAFLNDSQASKYPPKKVSELK 1919 
KTLQLAVAINQEPPDAKMLQMVLQG3VGATVNQGPLEVAQVFLAEIPADPKLYRHHNKLR 1930 
KTQELAFATHQDPADPKMLQMVLQG5VGTTVNQGPLEVAQVFLSEIPSDPKLFRHHNKLR 2052 
KVAELRQLCSSAEVDMIKLQLKLQG 3VSVQVNAGPLAYARAFLDDTNTKRYPDNKVKLLK 1890 
KTRELAFATEQDPPDAKMLQMVLQG3VGPTVNQGPLEVAQVFLAEIPEDPKLFRHHNKLR 1971 
KVSELNQLCTMEEVDMI SLQLKLQG 3VSVKVNAGPMAYARAFLEETNAKKYPDNQVKLLK 2092 



Coiled-coil 



DMFRKFIQACSIALELNERLIKEDQVEYHEGLKSNFRDMVKELSDIIHEQILQEDTMHSP 197 9 
LCFKEFIMRCGE?^VEKNKRLITADQREYQQELKKNYNKLKENLRPMIERKIPELYKPIFR 19 90 

LCFKDFTKRCED ALRKNKSLIGPVQKEYQRELGKLSS P 2090 

EVFRQFVEACGQALAVNERLIKEDQLEYQEEMKANYREMAKSLSEiyiHEQICPLEEKTS- 194 9 
LCFKDFCKKCEDALRKNKALIGPDQKEYHRELERNYCRLREALQPILTQRLPQLMAPTP- 2030 
EIFRQFADACGQALDVNERLIKEDQLEYQEELRSHYKDMLSELSTVyiNEQITGRDDLSKR 2152 



PDZ liga nd 
WMSNTLHVFCAISGT5SDRGYGSPR |YAEV| — 2008 
VESQKRDSFHRSSFRKCETQLSQGS 



2015 



VLPNSLHIFNAISGTPTSTMVHGMTSS ^SWI 1980 

— PGLRNSLNRASFRKADL 20 47 

GVDQTCTRVI SKAT PALPTVS 1 5 S |5 AEV| 2180 
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